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SEQUENCE CHARACTERISTICS OF BLADDER CANCER 

CROSS-RERERENCE TO RELATED APPLICATIONS 

The present application claims the benefit of priority to PCT application 
PCT/US00/41005, filed September 27, 2000, which claims the benefit of priority 
under 35 U.S.C. §11 9(e) of U\S. Provisional Patent Application Number 
60/156,153, filed September 2(, 1999, both of which are incorporated herein by 
reference. 
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The present invention relates to the identification of polynucleotide 
sequences that are differentially expressed in bladder cancer. More specifically, 
the present invention relates to the use of the sequences and gene products for 
diagnosis and as probes. 
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Bladder cancer is the second most-common genitourinary cancer 
in the United States, with only prostate cancer being more frequently diagnosed. 
Bladder cancer accounts for approximately two percent of all malignant tumors 
and approximately seven percent of all urinary tract malignancies in U.S. men. 
Over 54,000 new cases were estimated to be diagnosed in the United States in 
1998, with approximately 12,500 deaths predicted [American Cancer Society, 
1998]. The prevalence of bladder cancer is higher in industrialized nations, 
perhaps reflecting increased exposure to environmental carcinogens. Men are 
three times more frequently affected than women. The disease usually occurs 
between 60-70 years of age and the age-adjusted bladder cancer rate in white 
men is almost twice that of black men. Most bladder cancers (over 90%) are 
carcinomas of the transitional epithelium of the bladder's mucosal lining 
}:{ (transitional cell carcinoma (TCC)). Although 90 percent of the cases are 

25" localized at diagnosis, up to 80 percent recur. 
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A number of etiological factors are associated with the 
development of bladder cancer, but in industrialized countries, cigarette smoking 
is the most significant. Specific chemicals have also been identified as causing 
bladder cancer, as have a number of occupational exposures to less well-defined 
specific agents. Treatment with cytostatic drugs, especially cyclophosphamide, is 
associated with increased risk of bladder cancer, as is treatment with 
40 radiotherapy for uterine cancer. 
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Bladder cancer is a potentially preventable disease, with a 
significant morbidity and mortality in many parts of the world. 
Tumors are graded according to the degree of cellular abnormality, with the most 
atypical cells being designated as high-grade (i.e., G3 grade) tumors. The major 
prognostic factors in carcinoma of the bladder are the depth of invasion into the 
bladder wall and the degree of differentiation of the tumor. The higher the grade 
of the tumor at the diagnosis, the higher the incidence of death from the disease 
within two years. 

The stage of development of the tumor is significant in estimating 
disease prognosis. Most superficial, non-invasive tumors are papillary tumors 
which do not invade the lamina propria, and are classified as non-invasive TCC, 
i.e., "Ta" tumors, and they can recur, but nearly 70% will not progress further. A 
tumor which does not invade the muscle but does enter the lamina propria- 
presents in many cases a worse prognosis. Such tumors are also classified as 
non-invasive TCC but are termed T1 tumors. Most superficial tumors are well 
differentiated and classified as G1 grade tumors. Patients in whom superficial 
tumors are less differentiated, large, multiple, or associated with carcinoma in 
situ in other areas of the bladder mucosa, (classified as G2-G3 tumors) are at 
greatest risk for recurrence and the development of invasive cancer. Invasive 
bladder tumors tend to spread rapidly to the regional lymph nodes and then into 
adjacent structures. Overall, the five-year survival rate of TCC is 76 percent for 
whites and 55 percent for blacks. 



One of the management problems is the fact that carcinoma of the 
bladder is frequently multifocal. The entire bladder epithelium and the lining of 
the entire urothelial cell tract can undergo malignant change. After apparently 
successful treatment of a bladder lesion, new tumors can occur at the same site 
(recurrence) or in other urothelial cells in the bladder. Approximately 30 percent 
of bladder carcinomas present as multiple lesions at the time of initial diagnosis. 
The early diagnosis of bladder cancer is central to the effective treatment of 
TCC. Presently, the detection of bladder tumors relies on intravenous pyelogram 
or other contrast studies to rule out urothelial involvement in the kidneys or 
ureters, and invariably cystoscopy which remains the accepted standard for 
diagnosis of mucosal abnormalities. There are no presently reliable methods 
available to easily and specifically identify the presence of bladder cancer cells. 
A variety of new technologies and potential tumor markers are being studied in 
bladder cancer and some are being translated into clinical use. 

It is important to realize that all available results of the diagnostic 
value of tumor markers do not allow firm clinical recommendations, but tests 
based on biomarkers undoubtedly influence the management of bladder cancer 
in the near future. Several new markers have been already identified and even 
approved for use (e.g. bladder tumor antigen (BTA) markers, NMP22, FDP). 
However, their clinical use is limited [Grossman, 1998], due to sensitivity and 
specificity problems in conjunction with cystoscopic examination. 



Furthermore, due to the high rate of disease recurrence, follow-up 
of TCC patients is obligatory. There is a need to eliminate the invasive cytoscopy 
method of diagnosis and of follow-up and replace it with a reliable and non- 
invasive method of diagnosis. 

Approximately 70-80 percent of patients with newly diagnosed 
bladder cancer present with superficial, non-invasive bladder tumors. Those who 
do are often curable. Tumor patients with deeply invasive disease can 
sometimes be cured by complete surgical removal of the bladder, irradiation, or a 
combination of modalities that include chemotherapy, however the five-year 
survival rate is less likely for such tumors. It is therefore of major importance to 
detect new tools that aid in both the initial early diagnosis and in follow-up of 
non-invasive TCC tumors. 

Adverse prognostic features associated with a greater risk of 
disease progression include the presence of multiple aneuploid cell lines, nuclear 
p53 overexpression, and expression of the Lewis-x blood group antigen [Hudson 
and Herr, 1995; Lacombe et at., 1996]. It has been postulated that p53 can be 
useful for predicting the level of aggression of the tumor and to identify patients 
who will not benefit from chemotherapy. However, only a very small, select group 
of patients with invasive disease can benefit from this approach [Ozen, 1998]. 
Several treatment methods (i.e., transurethral surgery, intravesical medications, 
and cystectomy) have been used in the management of patients with superficial 



tumors, and each method can be associated with five-year survival in 55-80 
percent of patients treated [Hudson and Herr, 1995; Torti and Lum, 1984]. 
Invasive tumors that are confined to the bladder muscle on pathologic staging 
after radical cystectomy are associated with an approximately 75 percent, five- 
year progression-free rate of survival. Patients with more deeply invasive tumors 
(which are also usually less well differentiated) experience five-year survival 
rates of 20-40 percent following radical cystectomy. When the patient presents 
with a locally extensive tumor that invades pelvic viscera or with metastases to 
lymph nodes or distant sites, a five-year survival rate is uncommon, but 
considerable symptomatic palliation can still be achieved. 

Surgery is the main treatment method. The extent of surgery is 
dependent on the pathological stage of the disease. Early disease is generally 
treated by intravesical chemotherapy and transurethral resection. Locally 
invasive disease can usually be managed only by radical cystectomy and urinary 
diversion. Definitive (curative) radiotherapy is generally reserved for bladder 
cancer patients who are not candidates for surgery. For superficial, low-grade 
disease, chemotherapy is applied intravesical^ (directly into the bladder) to 
concentrate the drug at the tumor site and eliminate any residual tumor mass 
after resection. Systemic chemotherapy can also be used to manage advanced 
bladder cancer; compete response rates of 30-50 percent have been reported. 
Single agent chemotherapy has demonstrated limited success. 



However, even following surgery and resection of non-invasive 
TCC tumors, frequent follow-up is required (every 3 months) in both non-invasive 
and invasive cases. 

It would therefore be useful to be able to identify early stage TCC in 
bladder cancer which has a significantly higher cure rate and generally does not 
require surgery. In addition, it would be useful to identify markers that can be 
employed for early diagnosis and follow-up of both non-invasive and invasive 
TCC, as an efficient and non-invasive alternative to cytoscopy. 

SUMMARY OF THE INVENTION 

According to the present invention, there is provided a method of 
diagnosing the presence of bladder cancer in a patient by analyzing a patient- 
derived sample for the presence of a least one expressed gene wherein the high 
level of expression of the gene is indicative of bladder cancer. Also provided by 
the present invention is a polynucleotide sequence whose expression is 
indicative of bladder cancer. A marker for bladder cancer is also provided. 
There are also provided methods of diagnosing bladder cancer by screening for 
the presence of at least one expressed gene wherein the presence of the 
expressed gene is indicative of bladder cancer. Methods of treating and 
regulating bladder cancer-associated pathology by administering to a patient a 
therapeutically effective amount of a chemical compound which inhibits a gene 



comprising the nucleic acids sequences of the present invention are also 
provided. 

DESCRIPTION OF THE INVENTION 

According to the present invention, purified, isolated and cloned 
nucleic acid sequences associated with bladder cancer are provided. More 
specifically, the polynucleotides of the present invention are described in 
Tablesi, 2, and 5 and the corresponding sequences are set forth in Tables 3, 4 
and 6 respectively. 

When referring to bladder cancer, both invasive and noninvasive 
forms are included. Bladder cancers can also be referred to as transitional cell 
carcinomas or "TCC". 

- The present invention further provides a method of diagnosing the 

presence of bladder cancer in a patient, including the steps of analyzing a tissue 
sample from the patient for the presence of at least one expressed gene (up- 
regulated) wherein the mRNA from the expressed gene hybridizes to at least one 
of the sequences in Tables 1 or 2, with hybridization occurring under conditions 
sufficiently stringent to require at least 95% base pairing. 

Further the present invention provides antibodies directed against 
the gene products of the sequences of the present invention. The antibodies can 
be either monoclonal, polyclonal or recombinant and be used in immunoassays 
as described in the Methods herein below. 
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By regulate or modulate or control is meant that the process is 
either induced or inhibited to the degree necessary to effect a change in the 
process and the associated disease state in the patient Whether induction or 
inhibition is being contemplated is apparent from the process and disease being 
treated and is known to those skilled in the medical arts. The present invention 
identifies genes for gene therapy, diagnostics and therapeutics that have direct 
causal relationships between a disease and its related pathologies and up- or 
down-regulator (responder) genes. That is, the present invention is initiated by a 
physiological relationship between cause and effect. 

The present invention identifies polynucleotides named in Tables 1 
and 2, and set forth in Tables 3 and 4 respectively,- that can be utilized 
diagnostically in bladder cancer. Polynucleotides named in Table 1 were found to 
match sequences in data banks and were newly found in the present application 
to be upregulated in TCC. The polynucleotides named in Table 2 are either 
genes with unknown protein product or of unknown genes. All the 
polynucleotides named in both Tables 1 and 2 were found to be associated with 
TCC relative to normal bladder samples. The polynucleotides named in Table 5 
have their corresponding sequences set forth in Table 6 t some of which are 
novel. 

Where the sequences are partial sequences, they are markers or 
probes for genes that are regulated in bladder carcinoma. By "regulated" it is 
meant that the genes can be either upregulated or downregulated, depending 
upon the specific gene. In general these partial sequences are designated 



"Expressed Sequence Tags" (ESTs) and are markers for the genes actually 
expressed in vivo and are ascertained as described herein. Generally, ESTs 
comprise DNA sequences corresponding to a portion of nuclear encoded mRNA. 
The EST has a length that allows for PCR (polymerase chain reaction), use as a 
hybridization probe and is a unique designation for the gene with which it 
hybridizes (generally under conditions sufficiently stringent to require at least 
95% base pairing). 

For a detailed description and review of ESTs and their functional 
utility see WO 93/00353 which is incorporated in its entirety by reference. WO 
93/00353 further describes how the EST sequences can be used to identify the 
transcribed genes. The Example herein also describes a method of identification. 
The present invention also provides a method of diagnosing the presence of 
bladder cancer in a patient, by the expression of at least one expressed gene 
(up-regulated) identified by the polynucleotides of the present invention set forth 
in Tables 1-6. Methods of identification of hybridization results can include, but 
are not limited to, immunohistochemical staining of the tissue samples. Further 
for identification of the gene, in situ hybridization, Southern blotting, single strand 
conformational polymorphism (SSCP), restriction endonuclease fingerprinting 
(REF), PCR amplification and DNA-chip analysis using nucleic acid sequence of 
the present invention as probes/primers can be used. 

The present invention further provides proteins encoded by the 
identified genes. The present invention further provides antibodies directed 
against these proteins. The present invention further provides transgenic animals 



and cell lines carrying at least one expressible gene identified by the present 
invention. The present invention further provides knock-out eukaryotic organisms 
in which at least one nucleic acid sequences as identified by the probes of the 
present invention and prepared as described in the Methods. 

The present invention provides a method of diagnosing bladder 
cancer, in particular TCC, in a subject which comprises determining in a sample 
from the subject the level of expression of at least one polypeptide -encoding 
polynucleotide, wherein a higher level of expression of the polynucleotide 
compared to the level of expression of the polynucleotide in a subject free of 
bladder cancer is indicative of bladder cancer, and wherein the polypeptide - 
encoding polynucleotide comprises a polynucleotide selected from the group 
consisting of 

(a) the polynucleotides listed in Tables 3, 4 and 6; 

(b) polynucleotides having sequences that differ from the 
polynucleotides in (a), without changing the polypeptide encoded thereby; and 
(c) polynucleotides which are at least 70% homologous to the polynucleotides of 
(a). 

In a preferred method of the invention, the analyzing step 
comprises using mRNA from the expressed gene to hybridize to at least one of 
the sequences in Tables 3, 4 and 6. In other preferred methods of the invention, 
the analyzing step comprises using RT-PCR technology or using a specific 
antibody to detect the presence of a polypeptide encoded by said polynucleotide. 
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The present invention also provides a method of diagnosing of diagnosing stage 
Ta or stage T1 in TCC, which comprises determining in a sample from the 
patient the level of expression of at least one polypeptide -encoding 
polynucleotide, wherein a higher level of expression of the polynucleotide 
compared to the level of expression of the polynucleotide in a patient free of 
bladder cancer is indicative of stage Ta or stage T1 , and wherein the polypeptide 
-encoding polynucleotide comprises a polynucleotide selected from the group 
consisting of 

(a) the polynucleotides listed in Tables 3, 4 and 6; 

(b) polynucleotides having sequences that differ from the 
polynucleotides in (a), without changing the polypeptide encoded thereby; and 
(c) polynucleotides which are at least 70% homologous to the polynucleotides of 

(a). 

The present invention also provides isolated polynucleotides which 
comprise a polynucleotide selected from the group consisting of: 

(a) the novel polynucleotides listed in Tables 4 and 6; 

(b) polynucleotides having sequences that differ from the 
polynucleotides in (a), without changing the polypeptide encoded thereby; and 

(c) polynucleotides which are at least 70% homologous to the 

polynucleotides of (a). 

The present invention also provides such polynucleotides wherein 
the polynucleotide comprises a polynucleotide having at least 30, preferably at 
least 40, nucleotides from the polynucleotides described above. 



The present invention also provides compositions comprising the 
isolated polynucleotides of the invention. 

The present invention also provides an isolated polypeptide 
encoded by a polynucleotide, wherein the polynucleotide comprises a 
polynucleotide selected from the group consisting of: 

(a) the polynucleotides listed in Tables 3, 4 and 6; 

(b) polynucleotides having sequences that differ from the 
polynucleotides in (a), without changing the polypeptide encoded thereby; and 

(c) polynucleotides which are at least 70% homologous to the 

polynucleotides of (a). 

The present invention also provides such a polypeptide, wherein 
the polypeptide is a portion which retains the biologicalactivity thereof or a 
polypeptide which is at least substantially homologous or identical thereto. 

The present invention also provides a peptide, whereirvthe peptide 
is.dominant negative peptide which competes with the biological activity of the 
polypeptide. 

The present invention also provides an antibody which binds to a 
unique epitope of the polypeptide of the invention. The present invention also 
provides a method of diagnosing bladder cancer in a patient which comprises 
determining in a sample from the patient the level of expression of at least one 
polypeptide, wherein a higher level of polypeptide compared to the level of the 
polypeptide in a patient free of bladder cancer is indicative of bladder cancer. 
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The method includes using an antibody, preferably wherein the presence .of 
more than one polypeptide is detected by using more than one such antibody. 
The present invention also provides a method of treating bladder cancer- 
associated pathology in a subject by administering to the subject a 
therapeutically effective amount of a chemical compound which inhibits a gene, 
or polypeptide encoded thereby, which comprises a polynucleotide selected from 

the group consisting of: 

(a) the polynucleotides listed in Tables 3, 4 and 6; 

(b) polynucleotides having sequences that differ from the 
polynucleotides in (a), without changing the polypeptide encoded thereby; and 

(c) polynucleotides which are at least 70% homologous to the 

polynucleotides of (a). 

The present invention also provides a gene therapy vehicle for 
delivering a polynucleotide of the invention to a subject, whereby the 
polynucleotide is expressed-in the target cells of the subject. The present - 
invention also provides isolated antisense oligonucleotides complementary to 
the polynucleotides of the invention . 

The samples from the subjects which are used for diagnosis 
comprise samples of urine, blood, saliva, tissues and cells of all types; urine 
samples are preferred. A control sample includes a normal equivalent sample 
derived from a healthy subject. 

The term "antibody" includes polyclonal antibody, single chain 
antibody , Fab fragment, monoclonal (MAB), polyclonal and recombinant 
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antibodies. A molecule which comprises the antigen-binding portion (CDR) of an 
antibody specific for a polypeptide, variant or fragment is also included in the 

term "antibody". 

Negative dominant peptide refers to a partial cDNA sequence that 
encodes for a part of a protein, i.e. a peptide (see Herskowitz, 1987). This 
peptide can have a different function from the protein from which it was derived. 
It can interact with the full protein and inhibit its activity or it can interact with 
other proteins and inhibit their activity in response to the full protein. Negative 
dominant means that the peptide is able to overcome the natural proteins and 
fully inhibit their activity to give the cell a different characteristic, like resistance 
or sensitization to killing. For therapeutic intervention either the peptide itself is 
delivered as the active ingredient of a pharmaceutical composition or the cDNA 
can be delivered to the cell utilizing the same methods as for antisense delivery. 

The antagonist or regulating agent or active ingredient is dosed and 
delivered in a pharmaceutical^ acceptable carrier as described herein below. 
The term antagonist or antagonizing is used in its broadest sense. Antagonism 
can include any mechanism or treatment which results in inhibition, inactivation, 
blocking or reduction in gene activity or gene product and for example preventing 
progression from non-invasive to invasive. The antagonizing step can include 
blocking cellular receptors for the gene products and can include antisense 
treatment as discussed herein. 

Many reviews have covered the main aspects of antisense (AS) 
technology and its enormous therapeutic potential (Wright and Anazodo, 1995). 
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There are reviews on the chemical (Crooke, 1995; Uhlmann et al. 1990). cellular 
(Wagner, 1994) and therapeutic (Hanania, et al, 1995; Scanlon, ef a/, 1995; 
Gewirtz, 1 993) aspects of this rapidly developing technology. Antisense 
intervention in the expression of specific genes can be achieved by the use of 
synthetic AS oligonucleotide sequences (for recent reports see Lefebvre- 
d'Hellencourt et al, 1995; Agrawal, 1996; Lev-Lehman et al, 1997). AS 
oligonucleotide sequences can be short sequences of DNA, typically 15-30 mer 
but can be as small as 7 mer (Wagner et al. 1996), designed to complement a 
target mRNA of interest and form an RNA:AS duplex. (See also Calabretta et al, 
1996). Phosphorothioate antisense oligonucleotides do not normally show 
significant toxicity at concentrations that are effective, exhibit sufficient 
pharmacodynamic half-lives in animals (Agarwal et al., 1996) and are nuclease 
resistant. Instead of an antisense sequences as discussed herein above, 
ribozymes can be utilized. This is particularly necessary in cases where. 
. antisense therapy is limited by stoichiometric considerations (Sarver et al., 1990, 
Gene Regulation and Aids, pp. 305-325).. (See also Hampel and Tritz, 1989; 

Uhlenbeck, 1987). 

Ribozymes catalyze the phosphodiester bond cleavage of RNA. 

Several ribozyme structural families have been identified including Group I 

introns, RNase P, the hepatitis delta virus ribozyme, hammerhead ribozymes 

and the hairpin ribozyme (Sullivan, 1994; U.S. Patent No. 5,225,347, columns 4- 

5). Modifications or analogues of nucleotides can be introduced to improve the 
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therapeutic properties of the nucleotides. Improved properties include increased 
nuclease resistance and/or increased ability to permeate cell membranes. 
Nuclease resistance, where needed, is provided by any method known in the art 
that does not interfere with biological activity of the antisense oligodeoxy- 
nucleotides, cDNA and/or ribozymes as needed for the method of use and 
delivery (Iyer et al.. 1990; Eckstein, 1985; Spitzer and Eckstein, 1988; Woolf et 
al.. 1990; Shaw et al., 1991). Modifications that can be made to oligonucleotides 
in order to enhance nuclease resistance include, but are not limited to, modifying 
the phophorous or oxygen heteroatom in the phosphate backbone. These 
modifications also include preparing methyl phosphonates, phosphorothioates, 
phosphorodithioates and morpholino oligomers. 

The present invention also includes all analogues of, or 
modifications to, an oligonucleotide or polynucleotide of the invention that does 
not substantially affect the function of the oligonucleotide. The nucleotides can 
be selected from naturally occurring or synthetic modified bases. Naturally 
occurring bases include adenine, guanine, cytosine, thymine and uracil. Modified 
bases of the oligonucleotides include xanthine, hypoxanthine, 2-aminoadenine, 
6-methyl, 2-propyl and other alkyl adenines, 5-halo uracil, 5-halo cytosine, 6-aza 
cytosine and 6-aza thymine, psuedo uracil, 4-thiuracil, 8-halo adenine, 8- 
aminoadenine, 8-thiol adenine, 8-thiolalkyl adenines, 8-hydroxyl adenine and 
other 8-substituted adenines, 8-halo guanines, 8-amino guanine, 8-thiol guanine, 
8-thioalkyl guanines, 8-hydroxyl guanine and other substituted guanines, other 
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aza and deaza adenines, Cher aza and deaza guanines. S-.rifluorome.hyl uracil 

and 5-trifluoro cytosine. 

In addition, analogues of nucleotides and/or polynucleotides can be 

prepared wherein the structure of the nucleotide and/or polynucleotide is 
fundamentally altered and that are better suited as therapeutic or experimental 
reagents. An example of a nucleotide analogue is a peptide nucleic acid (PNA) 
wherein the deoxyribose (or ribose) phosphate backbone in DNA (or RNA) is 
replaced with a polyamide backbone which is similar to that found in peptides. 
PNA analogues have been shown to be resistant to degradation by enzymes and 
,o have extended lives in vivo and in vitro. Further, PNAs have been shown to 
bind stronger to a complementary DNA sequence than a DNA molecule. This 
observation is attributed to the lack of charge repulsion between the PNA strand 
and the DNA strand. Other modfflcations that can be made to oligonucleotides 
■ include polymer backbones, cyclic backbones, or acyclic backbones. 

The active ingredients of pharmaceutical compositions can include 
oligonucleotides that are nuclease resistant as are needed for the practice of the. 
invention or a fragment thereof shown to have the same effect when targeted 
against the appropriate sequence(s) and/or ribozymes. Combinations of active 
ingredients as disclosed in the present invention can be used, including 
combinations of antisense sequences. 

The antisense oligonucleotides (and/or ribozymes) and cDNA of 
the present invention can be synthesized by any method known in the art for 
ribonucleic or deoxyribonucleic nucleotides. For example, an Applied Biosystems 



18 



380B DNA synthesizer can be used. When fragments are used, two or more 
such sequences can be synthesized and linked together for use in the present 

invention. 

. The nucleotide sequences of the present invention can be 
delivered either directly or with viral or non-viral vectors. When delivered directly 
the sequences are general.y rendered nuclease resistant. Mernative.y the 
sequences can be incorporated into expression cassettes or constructs such that 
the sequence is expressed in the cell as discussed herein below. GeneraHy the 
construct contains the proper regulatory sequence or promoter to allow the 
sequence to be expressed in the targeted cell. 

The proteins of the present invention can be produced 
recombinants (see generally Marshak et ... 1996 "Strategies for Protein 
Purification and Characterization. A laboratory course manual.". CSHL Press) 
and analogues can be due to post-translational processing. 
More in particular, with respect to polynucleotides disclosed herein, and 
corresponding polypeptides expressed from them, the invention further 
comprehends isolated and/or purified polynucleotides (nucleic acid molecules) 
and isolated and/or purified polypeptides having at least about 70%. preferably 
at least about 75% homology . more preferab«y at least about 80% , even more 
preferably at least about 90% . most preferably at least about 95% homology to 
the polynucleotides and polypeptides disclosed herein. 

Nucleotide sequence homology can be determined using the 
"Align" program of Myers and Miller. ((1988) CAB.OS 4:11-17) and available at 
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NCBI. Alternatively or additionally, the term "homology" ." for instance, with 
respect to a nucleotide or amino acid sequence, can indicate a quantitative 
measure of homology between two sequences. The percent sequence 
homology can be calculated as (Nref - Ndif)-100/Nref , wherein Ndif is the total 
number of non-identical residues in the two sequences when aligned and 
wherein Nref is the number of residues in one of the sequences Hence, the 
DNA sequence AGTCAGTC has a sequence similarity o, 75% to AATCAATC 

(Nref = 8; Ndif = 2). 

Alternatively or additionally, "homology with respect to sequences 

can refer to the number of posrtions with identical nucleotides or amino acid 

residues divided by the number of nucleotides or amino add residues in the 

shorter of the two sequences wherein alignment of the two sequences can be 

determined in accordance with the Wilbur and Lipman algorithm ((1 983) Proc. 

Natl. Acad. Sci. USA 80:726), for instance, using a window size of 20 

nucleotides, a word length of 4 nucleotides, and a gap penalty of 4, and 

computer-assisted analysis and Interpretation of the sequenced including 

alignment can be conveniently performed using commercially available programs 

(e.g„ intelligences™ Suite. Intelligent Inc.. CA). When RNA sequences are 

said to be similar, or to have a degree of sequence identity or homology with 

DNA sequences, thymidine (T) in the DNA sequence is considered equal to 

uracil (U) in the RNA sequence . RNA sequences within the scope of the 

invention can be derived from DNA sequences or their complements, by 

substituting thymidine (T) in the DNA sequence with uracil (U). 
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Additionally or alternatively, amino acid sequence similarity or 
identity or homology can be determined, for instance, using the BlastP program 
(Altschul et al. Nucl. Acids Res. 25:3389-3402) and available at NCBI. The 
following references provide algorithms for comparing the relative percentage 
homology of amino acid residues of two proteins, and additionally, or 
alternatively, with respect to the foregoing, the teachings in these references can 
be used for determining percent homology: Smith et al. (1981) Adv. Appl. Math. 
2:482-489; Smith et al. (1983) Nucl. Acids Res. 11:2205-2220; Devereux et al. 
(1984) Nucl. Acids Res. 12:387-395; Feng et al. (1987) J. Molec. Evol. 25:351- 
360; Higgins et al. (1989) CABJOS 5:151-153; and Thompson et al. (1994) Nucl. 

Acids Res. 22:4673-480. 

Polynucleotide sequences that are complementary to any of the 
sequences or fragments encompassed by the present invention discussed above 
are.also'considered to be part of the present invention. Whenever any of the 
.sequences discussed above are produced in a cell, the complementary 

sequence is concomitantly produced and, thus, the complementary sequence 
can also be used as a probe for the same diagnostic purposes. 
"Functionally relevant" refers to the biological property of the molecule and in this 
context means an in vivo effector or antigenic function or activity that is directly 
or indirectly performed by a naturally occurring protein or nucleic acid molecule. 
Effector functions include, but are not limited to include, receptor binding, any 
enzymatic activity or enzyme modulatory activity, any carrier binding activity, any 
hormonal activity, any activity in promoting or inhibiting adhesion of cells to 
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extracellular matrix or cell surface molecules, or any structural role as well, as 
having the nucleic acid sequence encode functional protein and can be 
expressible. The antigenic functions essentially meanthe possession of an 
epitope or antigenic site that is capable of cross-reacting with antibodies raised 
against a naturally occurring protein. Biologically active analogues share an 
effector function of the native which can, but do not necessarily, additionally 

possess an antigenic function. 

The above discussion provides a factual basis for the use of the 
sequences of the present invention to identify bladder cancer-associated genes 
and provide diagnostic probes and markers to identify bladder cancer, 
particularly in the early stages of TCC. 

EXAMPLES 

EXAMPLE 1 

MFTHODS OF THF INVENTION 

A detailed description of the methods employed in the present 
invention is set forth in co-assigned US patent application USSN 09/534,661 
filed on March 24, 2000, corresponding to PCT patent publication number WO 
00/56935 and incorporated herein by reference in its entirety. The method 
includes preparing cell fractionations; extracting intact total RNA from membrane 
bound polysomes and free polysomes; preparing cDNA probes from template 
RNA derived from the extracted polysomes; performing microarray-based 
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comparison o« the relative abundance o. the different RNA species, analyzing 
the resu,ts; thereby identtfying genes or Cones encoding speedy membranai 

or secreted proteins. 

Identification of cDNAs and genes encoding secreted or 
membrana, encoding mRNAs is of major importance in TCC. More specify, 
novel genes which mark the eady stages of TCC and code for secreted prote.ns 
ar e *e uitimate markers for diagnosis and fo,,ow-up o, TCC. By deriving probes 
from template RNA extracted from membrane-bound polysomes and free 
polysomes and performing microarray-based comparison of the relative 
abundance of drfferen. RNA species, such po.en.iaHy secreted proteins can be 
identify. Analysis of the results of such comparison and identification of me 
ctones encoding for membrana, or secreted pro.eins provides a valuable tool 
which can be used together with Cher gene discover tools, and which in Kself 
enables identification of likely targets for drug development. 

Since membranal and secreted proteins are both accessible and 
enseal for transduction of numerous intra- and intercellular signals, they are 
generally viewed as preferred targets for pharmacological use and intention. 
Therefore, me a prion classification of arrayed unknown gene sequences into 
those mat potentially code for secreted and membrana, proteins is of great value 
forme optimization of a high-throughput process of identifying potentia, drug 
targets. Furthermore, the identification of genes which express membrana, or 
secreted proteins that are differently expressed in different cel,ular situations . 
of the utmost importance in designing therapeutic or diagnostic tools for TCC. 
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A method of identifying Cones which encode membrane, and seeded prote.ns 
was employed b , preparing bladder cancer cell fractionations, preparing cDNA 
prob es from template RNA derived from membrane-bound polysomes and free- 
porysomes, performing a microarray-based comparison of the relate 
abundance of different RNA species, analyzing the result, and thereby 
identifying genes encoding for membrana, and secreted proteins. Since 

| pharmacological intervene, the present invention thus provides a method o, 

% identifying likely targets for TCC diagnosis and therapy. 

tn uvRRiniZATIO M * Mn PROBES: 

PJ Trr. and normn l hvhrirlization: 

* The probes were prepared from normal healthy bladder samples 

I and from TCC tumors. Only intact RNA with a proper histological report 

J indicating the existence of TCC was used. All normal and tumor material was . 

coilected from two separate clinical centers. Such approach minimizes the 
influence o, local specie surgical bias or subject o, the pathological report. 
Forty-one hybridizes were performed. In each hybridization, two probes were 
used simultaneously, each labeled wtth either Cy3 or Cy5. 

These probes were as follows: Probe 1 Probe 1 was common to 
al, hybridizations (common control probe,. RNA from TCC samples was mixed 

was labeled with Cy3 and used in all hybridizations; and Probe 2. In each o, the 
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hybridUations, a different RNA sample from a single donor was used (test 

probe). 

A common control for all the hybridizations enables comparison of 
the results between the different hybridizations. If the common control (probe 1) 
hybridization results are similar in pattern in different hybridizations, comparison 
can be made between the resuKs of probe 2 hybridizations and al, hybridizations. 
Seventeen hybridizations included 16 RNA samples extracted from different 
control heaKhy bladder mucosa labe,ed with C y 5. Twenty-three hybridizes 
W ere performed with RNA samples derived from tumor tissues, either from non- 
invasive Ta or from T1 stages of development. Two hybridizations were 
performed with RNA extracted from 2 invasive TCC samples. 

The hybridizations were carried out in three separate sets, but the 
same common control was used in al, sets. Set 1 includes hybridizations 2-1 1 
(TC2-TC1 1), set 2 includes hybridizations. 16-25 (TC16-TC25). and set 3 
includes hybridizations 28-41 <TC28-TC41.)..By using three different sets of 
hybrid*a,ions, the possibly of technical effects reiated to specif, hybridizations 
is reduced. See Tables below and related description. 



" TCC cell line -T24- (from ATCC) was used for cellular fractionation. 

Membrane-bound polysomes were separated from free polysomes using a 
sucrose step gradient. RNA coding for potentially secreted proteins was isolated 
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from this microsomal-membrana. fraction and separated from RNA coding for 
intracellular proteins. Hybridization was performed as described hereunder. 
The probes used were as follows: Probe 1 . Free polysoma. RNA fraction labeled 
with Cy3; and Probe 2. Membrane-bound RNA fraction with Cy5. 

TCC CHIP PRFPARATION 

All hybridizations were performed on TCC designated microarray. 
The microarray was made up of cDNA clones derived from 3 different libraries: 
SDGI library: (Described in co-assigned US Patent Application USSN 
09/538,709, filed 30 March. 2000, corresponding to PCT application filed March. 
2001 and incorporated herein by reference in its entirety): A pool of non-invasive 
TCC, invasive TCC and normal bladder was used for library preparation. 4550 
clones from the SDGI library were included in the TCC chip. 
Antisense library: (Described in co-assigned US Provisional Patent Application 
SN 60/157,843 . filed 6 October. 1999. corresponding to PCT application 
PCT/US00/27557. filed 6 October, 2000, and incorporated herein by reference in 
its entirety): The same cDNA pool used for the SDGI library was used for the 
preparation of a library enriched for antisense sequences. 450 clones from this 

library were included in the TCC chip. 

SSH library: (Diatchenko et al., 1996). A subtraction library was 
made as follows. A normal bladder RNA pool was used for subtraction from non- 
invasive TCC RNA pool. The subtracted cDNA was used for the microarray 
printing. 5000 clones from the SSH library were used for printing. 
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rc c n«.ral method i" llar bioloOV. 

Standard molecular biology techniques known in the art and not 
specifically described were generally followed as in Sambrook et a,., Motocuto 
Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, New York 
(1989). and in Ausubel et al., Current Protocol in Molecular Biology, John Wiley 
and Sons, Balfimore, Maryland (1989) and in Perbal, A Practice, Guide to 
Molecular Cloning. John Wiley & Sons, New York (1988), and in Watson et a,.. 
Recombinant DNA, Scientific American Books, New York and in Birren e. a, 
( eds) Genome Analysis: A Laboratory Manual Senes. Vols. 1-4 Cold Spring 
Harbor Laboratory Press, New York (1998) and methodology as set forth in 
United States patents 4,666,828; 4,683.202; 4.80L531; 5.192.659 and 
5,272.057 and incorporated herein by reference. Polymerase chain reaction 
(PCR) was carried out generally as in PCR Protocols: A Guide To Methods And 
Ap p«calions. Academic Press, San Diego, CA (1990). In-situ (In-cell) PCR in. . 
combination with Flow Cytometry can be used for detection of cells containing 
specific DNA and mRNA sequences (Testoni et al, 1996, Blood 87;3822.) 
Genera, methods in immunology; Standard methods in immunology known in the 
art and not specifically described are generally followed as in Stites et al.(eds), 
Basic and Clinical Immunology (8th Edition), Appleton & Lange, Non»alk, CT 
(1994) and Mishell and Shiigi (eds), Selected Methods in Cellular Immunology, 
W.H. Freeman and Co., New York (1980). 
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immunoassays 

In general, ELlSAs where appropriate are one of the 
immunoassays employed to assess a specimen. EL.SA assays are we., Known 
t0 those skilled in the art. Both polyclona. and monoclonal antibodies can be 
used in the assays. Where appropriate other immunoassays, such as 
radioimmunoassays (R.A) can be used as are known to those in the art. 
Ava „ab.e immunoassays are extensively described in the patent and science 
literature. See, for example, United States patents 3,791,932; 3,839,153; 
3 850,752; 3,850,578; 3,853,987; 3,867,517; 3,879,262; 3,901 ,654; 3,935,074; 
3 984 533; 3,996,345; 4,034,074; 4,098,876; 4,879,219; 5,011,771 and 
5,281 ,521 as well as Sambrook et a«, Molecular Cloning: A Laboratory Manual, 
Cold Springs Harbor, New York, 1989 

Antibody Production 

Antibodies can be either monoclonal, polyclonal or recombinant. 
Conveniently, the antibodies can be prepared against the immunogen or portion 
thereof for example a synthetic peptide based on the sequence, or prepared 
recombinant* by cloning techniques or the nature, gene product and/or portions 
thereof can be isolated and used as the immunogen. Immunogens can be used 
,o produce antibodies by standard antibody production technology we,, Known to 
those s«ed in the art as described genera,,y in Hartow and Lane, Annies: A 
Laboratory Manual Cold Spring Harbor Labored, Cold Spring Harbor, NY, 
198 8 and Borrebaeck, /Woody Sneering - A Practical Guide. W.H. Freeman 
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and Co 1 992. Antibody fragments can a,so be prepared from the antibodies 
and ind.de Fab, F(ab'), and Fv by methods Known to those skilled in the art. 
For producing poiyCona, antibodies a host, such as a rabbi, or goat, is 
immunized w*h the immunogen or immunogen fragment, generaily with an 
adjuvant and. if necessa*. coupled." a carrier; an fi bod*s to the immunogen 
are ejected from the sera. Further, the polyclonal antibody can be absorbed 
such that , is monospectfe. That is, the sera can be absorbed against reiated 
immunogens so that no cross-reactive antibodies remain in the sera rendering * 
monospecific. 

For producing monoclonal antibodies the technique involves 
nyperimmunization of an appropriate donor with the immunogen, generally a 
mouse, and isolation of splenic antibody producing cells. These cells are fused 
t0 a ce„ having immortality, such as a myeloma ce„, to provide a fused cell 
nybrid which has immortal* and secretes the required antibody. The cells are 
the n cultured, in b* and the monoclonal antibodies halted from the cu,,ure 
media for use. 

For producing recombinant antibody (see generally Huston et al, 
1W1- Johnson and Bird, 1991; Mernaugh and Mernaugh, 1995), messenger 
RNAs «rom anybody producing B-lymphocytes of animals, or hybridoma are 
reverse-transcribed to obtain complementary DMAs (CDNAs) . Antibody cDNA, 
which can be full or partial length, is amplified and cloned into a phage or a 
piasmid. The cDNA can be a partial length of heavy and light chain cDNA, 
separated or connected by a linKer, e.g., encoding a single chain antibody. The 
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antibody, or anybody fragment, is expressed using a sui.ab,e expression system 
to obtain recombinant antibody. Antibody cDNA can also be obtained by 
screening pertinent expression libraries. 

The antibody can be bound to a solid support substrate or 
coniugated «ith a detectable moiety or be both bound and conjugated as is we,, 
known in the art. (For a genera, discussion o, conjugation of fluorescent or 
enzymatic moieties see Johnstone & Thorpe, /rnmunoMry * F»** 
B,ackwe,l Scien«f,c Publicans, Oxford, 1982.) The binding o, antibodies to a 
solid support substrate is also we,, Known in the art. (see for a genera, discuss,on 
Hariow t Lane An**** A Moratory Menus,, Oo,d Spring Harbor Laboratory 
Pub,ica«ions, New York, 1988 and Borrebaeck, AntiW Engineering - A 
Practice, GuiCe. W.H. Freeman and Co., 1992, The detectable moieties 
contemplated with the present invention can include, but are no, limited to, 
fluorescent, metallic, enzymatic and radioactive markers such as biotin. go,d. 
f errifin. valine phosphatase, p-galactosidase, peroxidase, urease, fluorescein, 
rhodamine, tritium, ,4 C and iodination. 

p, rnmhinnrrt """"" Purification 
Marshak et al, "Strategies for Protein Purification and 
Characterisation. A laboratory course manual." CSHL Press, 1996. 



re*ne therapy. 
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The genes described in this paten, can also be used as targets for 
g ene therapy, s,nce these genes can be of importance for the development ot 
TCC Therefore, targeted gene therapy against one or more of these genes , or 
agai „s, one or more of the corresponding poiypeptides encoded by these genes, 

used herein refers to the transfer of genetic materia, (e.g. DNA or RNA) of 

phenotype. The genetic materia, of interest encodes a product (e.g. a protein, 
peptide, peptide, functiona, RNA, antlsense) whose production ,n *o ,s 
desired. For example, the genetic materia, of interest can encode a hormone, 
receptor, enzyme, poiypeptide or pepttfe of therapeutic value. A— y. the 

the text "Gene Therapy" (Advances in Pharmacol 40. Academic Press. 
1997). 

Vectors can be introduced intp cells or tissues by any. one of a 
variety of Known methods within the art. Such methods can be found genera*, 
described in SambrooK e, a,., Mofecutor « A Laboratory Manua,, Cold 
Springs Harbor Labors, New VorK ,1989. 1992), in Ausube, e, a,, Current 
Protocols in Mo/ecu/ar BUW. John Wiley and Sons, Baltimore, Maryland 
( 1989), Chang etal., Soma,* Gene rnerapy, CRC Press, Ann Arbo, Ml (1995). 
Vega et a,., Gene Taring. CRC Press, Ann Arbor, Ml (1995), Vectors: A 
S^^OW, Vector and Teases. Butterworths. Boston MA 
(1988) and Gilboa e. al (1986, and include, for exampie. stable or transient 
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nation. lipofection. electroporation and infecUon with recombinant v.ral 
actors in addition, see United States paten, 4.866,042 for vectors involving the 
centra, nervous system and aiso United States patents 5,464,764 and 5,487.992 
for positive-negative selection methods. 

introduction of nucleic acids by infection offers several advantages 
over the other listed methods. Higher efficiency can be obtained due to their 
inf eCious nature. Moreover, viruses are very specialized and typically infect and 
propagate in specific cell types. Thus, their natural specificity can be used to 
target the vectors to specie cei, types in vivo or wfthin a tissue or mixed culture 
of cells. VW vectors can a,so be modified with specific receptors or ligands to 
alter target specificity through receptor mediated events. 

A specific example of DNA viral vector for introducing and 
expressing recombinant sequences is the adenovirus derived vector 
Adenop53TK. This vector expresses a herpes virus thymidine kinase (TK) gene 
for eKher positrve or negate selection and an expression cassette for desired 
recombinant sequences. This vector can be used to infect cells that have an 
adenovirus receptor which includes most cancers of epithe,ial origin as wel, as 
others This vector as we,, as others that exhibit similar desired functions can be 
used to treat a mixed P opu,ation of ce„s and can include, for example, an in v,tro 
or ex vivo culture of cells, a tissue or a human subject. 

Additional features can be added to the vector to ensure its safety 
and/or enhance its therapetrtc efficacy. Such features include, for example, 
markers ma. can be used to negatively select against cells infected with the 
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gene described above that confers sensitive to the antibiotic gancyclow. 
Negafive seieotion is therefore a means by which infection can be control 
..cause « provides inducible suicide through the addition of antibiotic. Such 
pra ,ec,ion ensures tha, ». for exampte. mutations arise that produce aKered 
form s of the vira, vector or recombinant se„uence, ceiiuiar transformation can 
not occur. 

Features that limit expression to particular cell types can also be 
included. Such features include, for example, promoter and regulatory elements 

that are specific for the desired cell type. 

in addition, recombinant viral vectors are useful for in vivo 
expression of a desired nucleic acid because they offer advantages such as 
, a ,e,a, infection and targeting specificity. Utera, infection is inherent in the Me 

produces many progeny virions that bud off and in.ee, neighboring ce,,s. The 
r esu,t is that a ,arge area becomes rapid,, infected, most of which was not 
«a„y infected by the ohgina, vira, partiCe, This is in contrast to ve„ica,-type of 
infection in which the infects agent spreads on,y through daughter progeny. 
Vira, vectors can a,so be produced that are unable to spread ,atera„y. Th,s 

characteristic can be usefu, if the desired purpose is to introduce a specified 

gene into only a localized number of targeted cells. 

As described above, viruses are very specia,ized infectious agents 

tha t have evolved, in many cases, to elude host defense mechanisms. Typica,ty, 
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viruses infect and propagate in specific oe>, types, The targeting specificity o, 
viral vectors utiiizes * natural spec ffi city to specifically target predefined ce,i 
W es and thereby introduce a recombinant gene into the infected ce,i. The 

be treated then a vector specific for such epithelial cells are used. 

Retroviral vectors can be constructed to function either as 

infectious particles or to undergo only a single initia, round o, infection, 
former case, the genome of ,he virus is modified so that * maintains al, 
necessary genes, regulator sequences and pacKaging signals to synthesize 
new viral proteins and RNA. Once these molecules are synthesized, the host cel, . 
packages the RNA into new vira, particles which are capable of undergoing ■ - 
furt ner rounds of infection. The vectors genome is also engineered to encode 
and express the desired recombinant gene. In the case of non-infectious v,.a, 
vectors, the vector genome is usu* mutated to destroy the vira, pacKaging 
signal ma, is required to encapsulate the RNA into viral particles. Without such a 
signal, any particles that are formed do not contain a genome and therefore 
cannot proceed through subsequent rounds o, infection. The specfflc type of 
vector depends upon the intended application. The actual vectors are also 
Known and readily available within the art or can be constructed by one sKILed in 

the art using well-known methodology. 

The recombinant vector can be administered in several ways. If 
viral vectors are used, for example, the procedure can take advantage of their 
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,a,ge. specificity and consequently, do no, have to be administered locally a. the 
diseased site. However, local administration can provide a quicker and more 
effective treatment, administration can also be performed by. for example, 
intravenous or subcutaneous injection into the subject. Following injection, the 
viral vectors circulate until they recognize host cells wr.h the appropriate target 

specificity for infection. 

An alternate mode of administration can be by direct inoculation 
into the bladder i.e., locally to the ste of the disease or pathological condition or 
by inoculation into the vascular system supplying the site with nutrients. Local 
administration is advantageous because there is no dilution effect and, therefore. 
., a smaller dose is required to achieve expression in a majority of the targeted 
cells. Additionally, local inoculation can alleviate the targeting requirement 
required with other forms of administration since a vector can be used that 
infects all cells in the inoculated area. If expression is desired in only a specific 
subset of cells within the inoculated area, then promoter and regulatory elements 
that are specific for the desired subset can be used to accomplish this goal. 
Such non-targeting vectors can be, for example, viral vectors, viral genome, 
piasmids. phagemids and the like. Transfection vehicles such as liposomes can 
also be used to introduce the non-viral vectors described above into recipient 
cells within the inoculated area. Such transfec*on vehicles are.known by one 
skilled within the art. 



Chemical compounds 
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The chemical compounds to be administered comprise jnteralia 
small chemica, molecules; antibodies of al, types or fragments thereof including 
single chain antibodies; antisense oligonucleotides, antisense oligonucleotides, 
polynucleotides-, DNA or RNA molecules; proteins, polypeptides and peptides 
including peptido-mimetics and dominant negative peptides; ribozymes ; and 
expression vectors 

n»li,mr y nf rhemica ' mmOQUnd 

The compound of the present invention is administered and dosed 
in accordance with good medical practice, taking into account the clinical 
condition of the individual patient, the site and method of administration, 
scheduling of administration, patient age, sex, body weight and other factors 
known to medical practitioners. The pharmaceutic* "effective amount" for 
purposes herein is thus determined by such considerations as are known in the 
art. The amount must be effective to achieve improvement including but not 
iimited to improved survival rate or more rapid recovery, or improvement or 
elimination of symptoms and other indicators as are selected as appropriate 

measures by those skilled in the art. 

In the method of the present invention, the compound of the 
present invention can be administered in various ways, .t should be noted that it 
can be administered as the compound or as pharmaceutic* acceptable sa.t 
and can be administered alone or as an active ingredient in combination with 
pharmaceutical!, acceptable carriers, diluents, adjuvants and vehicles. The 
compounds can be administered intravesical* (directly into the bladder),orally, 
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su bcu,aneous,y or parent* ^ in-—- 

intraperitoneal*, intranasal administration as we,, as intrathecal and infusion 
technics. ,mp,an.s o, the compounds are aiso usefu,. The patient being 
.eated is a warm-blooded anima, and. in particular, mammals including man. 
The pharmaceutical* acceptable carriers, dliuents, adjuvants and vehicles as 
^ as implant carers general refer to inert, non-toxic solid or liquid fillers, 

ol ,uen,s or encapsulating materia, no. reacfing w«h the active ingredients o, the 

invention. 

« is noted that humans are treated generally longer than the mice 
or other experimental animals exemplified herein which treatment has a length 
propomona, to the length of the disease process and drug effectiveness. The 

single closes are preferred. 

The doses can be single doses or multiple doses over a period of • 
several days. The treatment genera,* has a ,eng.h proposal to the length of 
the disease process and drug effecfiveness and the patient species being 
treated. 

When administering the compound of the present invention 
parenteral!,, it is generally formulated in a un* dosage iniectable form (solution, 
suspension, emulsion). The pharmaceutic, formulations suKable for injection 
include sterile aqueous solutions or dispersions and sterile powders for 
.constitution into sterifc injectab te solutions or dispersions. The carrier can be a 
solvent or dispersing medium containing, for example, water, ethanol. polyo, (for 



37 



example, glycerol, Propv^na glycol. Ilguid polygene glycol, and *.«-). 
suitable mixtures thereof, and vegetable oils. 

Pr0 per fluidity can be maintained, for example, by the use of a 
coa ,i„g such as ,e*in. by the maintenance of the reared pa.de size in the 

for compound composes. AddKionally. various additives which enhance the 
stab*, sterility, and isotonic* of the compositions, including antimicrobial 
preservatives. an«ox,dants, chelating agents, and buffers, can be added. 
Pr even,ion ofthe action o, microorgantems can be ensured by vanous 
^bacteria, and an«unga, agents, for example, parabens, chiorobutanol. 
phenol, sorbicacid, and the ,IKe,n many cases, i, is desirable to include isoton.o 
ag en,s.,rexam P le. sugars, sodium chloridefand the IIKe. Prolonged absorb 

delaying absorption, for example, aluminum monostearate and ge,a„n. 

used have to be compatible with the compounds. 

Sterile injectable solutions can be prepared by incorporating the 

, ne appropriate sofcen, «h various of the other ingredients, as des.red. 
A pharmacological tonnuWon o, the present invent can be administered ,o 
pa , te nt in an iniectable formulae containing any compatible carrier, such as 
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va rious ve.Ce, adjuvants. addKves. and diluents; or the -pounds utiiized in 
the present invent oan be ministered parent to the patient in the ton. 
of siow-reiease subcutaneous implants or targeted deliver systems such as 
monoclonal antibodies, vectored delivery, iontophoretic, poiymer matnces, 
, ipo somes, and microspheres. Examples of deliver systems useful ,n the 
preS ent invention include: 5,225,32; 5,169,383; 5,67,616; 4,959,217; 
4 925,673; 4.437.603; 4,436 , 94; 4,447,233; 4.447.224; 4,439,196; and 

known to those skilled in the art. 

A pharmacological formulation of the compound utilized in the 
present invent can be administered orally to the patient. Conventional 
methods such as adminfctering the compounds in tablets, suspensions, 
solutions, emulsions, capsules, powders, syrups and the ,,Ke are usable. Known 
techniques which deliver it orally or intravenously or directly to the bladder 
(intravesical!,) and retain the bioiogical activity are preferred. In one 
embodiment, the compound o, the present invent can be administered «a„y 
by intravenous injection to bnng b,ood levels to a suitable ,evel. The patienfs 
, eve ,s are men maintained by an oral dosage form, although other forms of 
a dmin*tratio, dependent upon the patienfs condition and as indicated above. 
can be used. The ouantty to be administered vary for the patient being treated 

day and preferably are from 10 .g/kg to 10 mg/kg per day. 
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EXAMPLE 2 

POLYNUCLEOTIDES AND DIAGNOSTIC APPLICATIONS 
Utiiizing the methods set forth above, the polynucleotides set forth 
in Tables I and 2 were Identic and cloned as being differentially expressed In 
bladder cancer. 41 hybridizations were compared. 

The polynucleotides described in Table I are identified by clone 
number and accession number. This lis. includes sequences of Known genes 
wh ose funcfion in bladder cancer was heretofore unKnown and which were now 
fo und to unregulated in bladder cancer. Corresponding nucleic acid sequences 

are provided in Table 3. 

Tne polynucleotides described in Table 2 are identified by clone 

number. This list includes sequences of nove! genes which have no idenfity to 
Known proteins or genes in the gene database, Corresponding nucleic acd 

sequences are provided- in Table 4. 

,n both Tables I and 2. the differential expression pattern of the 
differe n, hybridization probes is provided. In both Table I and 2. the genes listed 
we re found to be upregulated in at least 60% of TCC samples and unchanged ,n 

at least 75% of the normal samples. 

Tables I and 2 show the genes as described in biological NCBI 
databases, wKh the Genebank number of each gene (where applicable) as 
presented in the NCBI database. The location of the Cone in the TCC 

the TCC chip. 
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The expression differentials described in Tables I and 2 were 
calculated as follows: Since a common control probe was used for all 
Hybridizations and the hybridfcations were earned ou, in three separate sets, the 
expression differentials in each respective set were calculated as compared to 
one of the normal bladder samples, as a reference probe. 

Thus, hybridization set 1 which includes hybridizations TC2-TC1 1 . 
a„ the results are shown as compared to the TC7 (norma,, hybridization result. In 

hybridiza.cn se, 2 which includes hybridizations TC16-TC25, all the results were 

calculated in comparison to the TC22 (normal) hybridization result. In set 3. 

which includes hybridizations TC28-TC41 . a„ the results were calculated 

compared to the reference normal probe from TC47. 



F""°"' KEY GENES 

,„ the present invention, the results of the 41 hybridizations were 
analyzed on the TCC microarray, in orderto provide a MM meaningful set 
o, genes (which each include one of the polynucleotides identified, that can 
,den«y TCC sampfcs and be used as a TCC marker se,. As a resuf, a sub-set 
of twenty-two (22) potential molecular markers for non-invasive TCC was 
identified and validated using supervised statistical analysis methods. The 22 
ge nes identify as potential markers (listed in Table 5) code for secreted factors, 
cytoskeleta, and membranal proteins. a„ potentially suitable for the development 
of non-invasive diagnostic tests. This marker se, o, genes is described below. ,n 
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Exa mp,e 4, Section 5. en«led "Expression patterns, scores and significance 
valU es ,or 22 short-listed genes", and related Tables C1 and C2. 
Thirteen (13, o, these 22 polynucleotides are described in Example 2 ( see 
Tables 1-4). and nine (9) are newly described in this Example (see Tables 5-6). 
,n Table 5, the polynucleotides already described in Tables 1 and 2 are 

designated with an "x". 

The 22 gene marker set was identified following reanalysis of the 

41 hybridizations. A„ the experiments were constructed so that such an analysis 
can be performed. The hybridize scheme (describe in Section 1 hereunder, 
wa s based on both indivtfua, sample hybridizations and a common centre, 
appr oach. All the hybridizations passed oua.ity control examination and pre- 
processing steps (described in Sections 2 and 3 hereunder, which are critical to 
e^ish input mateha, suHable for any statistical analysis. Mowing these pre- 
passing steps, the hybridation data was scored according to its "similarity- to- 
. .he desired domination - non-invasive TCC versus norma, urothelium (see 
Example 5). 

Two independent (though related) standard scoring methods were 
used and individual genes were seiected that discriminate between noninvasive 
TCC and norma, urothelium (see Example 5. Section 4). 

Full bioinformatic annotation analysis of the 22 listed genes (see 
Example 5, Section 6, and detaHed description of their potential biologica, 
rel evance to cancer in general and to TCC in particular is presented here.n (see 
Example 5, Sections 6 and 7). 



42 



Based on .he sequence annotation. 17 of the select 22 markers 
were found to be Known, characterized genes and 5 code for genes with an 

with ye. unknown protein products code for membranal or secreted pro.e,ns. 
b ased on the Applicant propneta* "secreted" probe (described in co-assigned 
USSN 09/534, corresponding to PCT patent publican number WO 00/56935 
which ,s incorporated herein by reference in its entirety, and on domain ana,ys,s. 
- ( see Exampie 5, Sections 4 and 6). Being secreted, some of these proteins can 
be identified in body fluids (in particular urine), thus abating the need for 
invasive tests. Al, other non-secreted proteins can a,so be detected in urine, 
which always contains shedded urothelial cells. 

For a diagnostic assay, urine samples of TCC patients and of non- 
TCC patients should be analyzed. Urine can be collected, preserved in -70-C 
and used e*ner for protein assays (Western analysis) with the relevant anybody 
and,or,or ELISA tests with the same re,evan, antibody. Similarly, blood samples 
from the same donors can be collected, and the separated serum samples can 
be used for detection of me candidate proteins in me serum using similar protein 
analysis approach. After establishing .he particular assay for a single protein, 
assays for a combination of 2 or more different proteins can be se, to increase 
, he va,* of the obtained results for each sample and to obtain robustness. 

According to biomedical literature, the 17 known genes were 
d assrf,ed into three fun*na, groups: tumorigenesis, keratinocyte differentiation, 
and cel, motifrfy and prolKeratio, Thus, these markers can also ,u« a fun*na, 
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ro ,e in TCC. Being function* reievan, these genes can be used as poss.bie 
.argets ,cr genes therapy for TCC. This can be achieved by antagonizing the,r 
affects in the tumor using antisense deiivea; approach (for a„ such proteins,, 
b,ocking their enzymatic activity „or enzymes,, or specific drug de,ivery, as 
r e,evant. Since dtferent keratins were detected as being differentia,, expressed 
in TCC, spectfc typing of the different keratins in urine o, TCC patients, using a 
sin g,e mu„i-gene assay, can fa-ate and improve robust TCC diagnostics. 

The specificity of markers for TCC over other cancers ,s an 
import an, consideration, in particuiar, the expression ,eve, of the seiected gene 
set is anaiyzed in Cher urogenKa, cancers, such as rena, carcinoma and 
prostate cancer, important,, sampies obtained from c,inica,,y re,evant con**. 
such as inflammation or benign prostate hyperpiasia (BPH) must be induded. 
prospective studies of patients is a,so be carried out. as we,, as comparison of 
sampies obtained during foiiow-up procedures (to monitor tumor progression ,n 
TCC patients). 

Ail the genes descnbed in the present invention are tested for the,r 
, eve , o, expression of exfoiiated ce„s in urine, according to the lowing protoco, 
Urine sampies (e.g. 100m, o, urine, are coiiected from 3 different popuUrtons: 
h ea,thy donors, TCC patients and a re,evan, centre, group (e.g., prostate cancer. 
b ,adder inflammatton). The ex.Cia.ed ce„s are separated from urine (,. ,s 
possibie to Keep the separafed ce„s in -70-C pending further work, and used for 
preparation of RNA. Such an approach enabies tracking cancer-reiated changes 
a, the ,eve, of gene expression. Foiiowing RNA extraction, RT-PCR is performed 
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for sefccted genes. Primers specific for each of these genes ere coyote, and 
used for the amplification of .he cDNA products, being constructed so tha, each 
o, the tested genes are be amplified to a .ragmen, o, a drfferen. size. RT-PCR 
.action is carried ou, in semiquantitative approach. Fractionation .or the 
resu ,tan, product, on geis indicates the reiative abundance of each of the tested 
genes in the tested RNA sampie. Mernativeiy, TaoMan (Applied Biosystems) 
enabies a fuiiy guan.Ka.ive time-course demonstration of the leve, of expression 
of these genes. ResuKs for each o. the tested genes are defined and 
documented and used for statisticai analysis. Finally, a value is caicuiated for the 
expression of the predictor gene set in TCC and in non-TCC samples. 
Comparison ofthe expression level resuKs for a grven unknown sample to th,s 
Known ca,cu,a.ed value predicts „ the tested sample contains TCC (under certain 
confidence leve, .p value,. All information from the tested samples is gathered 
dU ring the establishment o, the described diagnosfic protocol and the statistical 
analysis is expanded so that al, samples participating in the study are included. 

According to the present invention, gene sequences are included 
^ich are uniformly expressed in norma, and TCC tissues (see Example 5, 
Section 8). These genes can be used as an Interna, contro, in each multiplex RT- 
PCR. To this end primers for amplification o, such genes are constructed and 
applied within the RT-PCR reaction of the marker gene set. 

,„ further analysis, the results obtained from a single donor are 
compared between different tissues obtained from the same donor, e.g., 
ma ,ched urine exfoliated ce„s and tumor tissue (for RT-PCR approach) or unne 
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and blood fo, the protein analysis approach. This enables a deeper 
understanding of the molecular changes associated with TCC and their genera, 

presentation in different organs. 

The genes provided in the present patent application can also be 
used for printing a small diagnostic TCC mini-microarray. This chip includes a,so 
Cones w«h a unKorm high expression in both norma, urothelium and TCC (see 
sample 5. Section 8). Such TCC mini-microarray can be used for bom disease 
detection and validation and for mo,ecu,ar staging and grading of the TCC 
tumors. Samples for hybridization on such chip inCude materia, derived from 
TCC tumors and from norma, urothelium from different donors. 

,n addition, urine exfoliated cells from the same donors can be 
used for RNA extraction and RNA amplification. The RNA can be used for 
generating cDNA probes for the TCC mini cDNA microarray. This enables 
characterization of gene expression patterns of a„ the printed genes and 
comparison between the expression pattern obtained using tissue-type matenal 

fr0 m non-TCC donors such as patents with BPH and inflammation, these also 
comprise part of the hybridization probes. 

The present invention further presents the use of a smal, subset of 
genes (2 or more genes) together for providing an accurate diagnostic tes, for 
TCC With one exception (the cytoKeratin 8 and « assay), al, commercial 
exi s«ng molecular diagnostics for TCC are based on tests for single proteins. 
These can be insufficient to account for the inherent complexity of cancer, as 
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well as for the ^ o, both heaimy an. affected popular, To this end. 
t he pr esen, invention describes the use o, a con.ina.ion o, several 3 enes and,or 

patients for multiple-gene RT-PCR diagnostic testing. 

lr , slt u hybridization analysis using the same gene set can be 
performed as an auxiliary gua,«ative validation step, using paraffin blocks from 

normal urothelium and TCC tumors. 

The genes of the present invention also characterize different 
stages o, TCC. Correct "staging" o, TCC is fundamental for the management o, 
t his disease. Upon detect of a new TCC patient, the developmental stage of 
th e tumors determines relevanttreatment. For example, if a non-invasive tumor 

is identtfed, "TURV is the surgical approach recommended. ... however, the 

tu mor is-defined as invasive TCC, cystectomy is usually the treatment of chcce. 

.dentin of those noninvasive TCC patients that might progress is of.great 

clinical value. 

Keratin 13 is identified herein as a marker tha, can differentiate Ta 
fro m T1 and invasive tumors (see Section 10). The analysis described in .he 
pre sen. invent indica.es a clear discrimina.,on between Ta and T1 tumors, 
where .his gene is upregu,a.ed in Ta .umors and downreguiated in T1 tumors 
wh en compared .0 norma, urotheiium. This gene in include part of .he diagnose 
tests described. 
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sorting .0 the present invention, 22 polynucleotides included ,n 
22 ge nes were identKed; <hese g enes serve as potentia, marKers for TCC. 
especi* tor no,invasive TCC. These g enes, and a„ the genes included ,n 
Tables 1-6. can be used for diagnosis of TCC. 

length genes or gene fragments are suggested as marKers for 
invasive assays. PCR products, antisense products, protein products and 

. j aMjnst tnese genes can be applied both for diagnoses for 
antibodies raised against tnese go. 

TCC and fortargeted gene therapy. The tests for the levels of these genes 
and ,or proteins can be performed in body «uids, in the original tumor or ,n other 

EX^PLE 4 : HYBR.D.ZATIONS AND STAT.ST.CAL ANALYSIS 
Section 1. Hybridization scheme 

The hybridization scheme according to the present invention is 

based on three principles - • 

, individual hybridization of each sample (normal or TCC) 

^enever possible: This provides a comp— overview of the en«re 
samp ,e set, with minima, a-pHori assumptions, and with maxima, measurement 

cr U cia,forsuccessf U ,ana,ysis of the resu,ts,nasma„ number of cases, dueto 
.sufficient amounts of norma, uro,he,ium materia,, poo,s of severa, norma, 
samples were used as a single probe (See Table A,3« set). 
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2 Utilization of an identical common normalizing probe 
(-Common Control" or "CC") in each se, o, hybridation*: By maintaining one o, 
the probes as a constant, common probe, hybridation results can be compared 
across experiments. The common normalizing probe used in the present 

samples. This materia, should be similar in composition to the one used .or 
construction o, the TCC microarray. Thus, it has a high probability to hybrid^ 

appropriate normalization of signate between hybridizations. 

3. Secreted and membranal proteins have an obvious 
advantage as molecular markers. The "secreted- probe allows sequence- 
independent identic of genes potentially coding for secreted and 
m embrana, proteins. ,f such genes are highly expressed in tumors i, is plausible 
,o try to find their protein products highly expressed in urine, too. 

Hybridizations of TCC and of normal urothelium samples were 
performed in 3 sets which were separated in time as we,, as in the methods of 
RNA preparation (po, y A and tota,). as shown in Table A. AKhough these 
dW erences Increase the variably o, the results, they a,so suggest fta, the 
identmed phenomena are robust to experimental intricacies. Comparison of gene 
expression results between the sets increases the valid* of the results obtained. 
Drfferences in RNA preparation can also affect the common normaiizing probe. 
For examp,e, in the flrst two sets an identical tota, RNA poo, was used (Table A 
common normal probet), whiie in the 3- set polyA RNA was extracted from 
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the same pooi ot iu«> 



normalizing probe2). 

Tabic A: Derailed hyhridtaanon scheme 




Membrane bound 
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, n order to identify secreted and membrane, proteins, two 

SW780. Briefly, membrane-bound poiyson.es were separated from free 
polysomes usin g sucrose step 3 rad,ent. RNA coding for potential secreted 

f or in,racei,u,ar proteins front the free poiysoma, peiiet. Each RNA ("Secreted" 
and -intraceiiuiaf) was ,abe,,ed with a different dye and hybridized to the TCC 
array (Tabie A. set 4). Signtfcan, differentia, expression in one of the probes . 
a „ indication of potential celiuiar compartments (intraceiluiar or 
secreted/membrana,,. As a convention, a negative differentia, represents 

secreted proteins. 

section 2. Qua,Ky contro, (QO, prehminary eva.uation 

ln order to ensure the o.ua,ity of me results shows in the present 
invention and to minimize expehmenta, artefacts, a,, hybridization resuits 
underwent severa, standard OC steps. Since the hybridizations were performed 
in three separate sets, QC procedures were done «Nn sets, consistent w*h 
inventors' past experience. 

expression ,eve,s are compared across hybridizations due to the use o, a 
common nom,a,izing probe. However, this can be farthfuiiy performed on,y * the 

oommon contro, probe behaves consistent across each set of hybridization. 

This constancy is *rs. measured by the pair-wise corrections between the 
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for this data set. 
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2 SianaUuafc A second measure of hybridation quality is 
t he number of elements which yieided a significant signa, and a re,iab,e signa, to 
ground ( S2B) rafio w«h each probe. Since a custom cDNA array was used. 
b0 ,h experiment and centre, probes are expected to yield a similar number or 
sig „ ffl cant signals (A se, or n hybridizations of an m-gene array is typically 
trs a,ed as a matrix A of size mXn. Thus the expression ieve, of each gene in ail n 
nations is a vector o, size m ("gene vector-)). A singie hybridization 
experiment is represented by a vector of m expression measurements 
.-hybridization vectoO- m differentia, profiling the vector can represent a s,ng,e 
prob e (in which case . is a vector of signa,s) or both (a vector o, dWerentials)). 
Thus when comparing the hybridization o.ua,«y of the common con.ro, probes, 
pair-wise correction was calculated between the common contro, sfcna, vector ,n 
one hybridation to that o, another. Missing values were deieted on a case-wise 
basis Traditlona, threshoid (200 units, for signais. for both common contro, and 
.ested samp,e probes, and.S2B (va,ue of 2.5 for at ,eas, 40% coverage of 

element) were used (Table B2). 

The first and third sets of hybridizations yielded signals of high 
^aiity in both common contro, and experiment probes. The q ua,«y o, the second 
set was significantly lower (Table B2). 
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Table B2: Signal qua 
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Bmsssmi ^Bm^mmm. Hierarchical clustering 
of me hybridizations in each set provides an additional. a,bei. prelimina^. 
estimate o, gua,*y. Either the Pearson correiation coefficient or a standard 
Euclidean distance was used as the distance measure between different,. 
hyb ridization vectors. Hybridizes were clustered according to these distances 
by average linKage hierarchies clustering. Missing vaiues were deieted on a 
case-wise basis, dusters o, hybridizations cn be iden.if.ed and evaluated ,n 

0 light of existing knowledge. 

1 Many of the unexpected phenomena can indicate the limitation of 
1 previous understanding, and serve as a starting point for Cass definHion. 

S However, "outlying* hybridizations can aiso indicate q ua,Ky prob,ems. Overall m 

» each se, (Table B3, most of the separation between hybridizations is consistent 

1 with the expected TCC and nonmal urothelium separation. Even in hybridizers 

1 of lower guality. such as those of the second se, a clear separa«on between 

TCC and normal samples is observed. 

One of me TCC samples in the first set (TC6) is such an "outlyer 
(Table B3>. as well as one o, the normal samples in the second set. and another 
norma, sample (TC35) in the third. (The "outlyers" do no, appear to be 
m isc,assmcations). For example. TC35 (a norma, sample which is an "ouflyer in 
the third set) does not behave liKe a TCC sample. Rather those genes tha, are 
up-regulated in TCC samples are down-regulated in TC35). 
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Table B3: Relationrhip. hetw.en hybridizations: Hierarchical 
clustering of hybridizations (by sets) 
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Table B4 Hierarchy 



TCC All 
Raw data 
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None of the "outlyers" was eliminated from subsequent analysis 
steps. Rather, they were included to facilrtate the selection of a more robust 
marKer set (Tab.e B3). Here, more complex relations are observed between 
g,obal expression profiles. First, the two invasive hybridizations (TC34 and 
TC45) are distinct from other TCC samples (Tables B3 and B4>. Second, the 
relationship between global Ta and Tt profiles is no. straightforward. Most of the 
Ta samples form a unique cluster in the 3rd set while the T1 samples are more 
dispersed. 

Section 3. Pre-processing of the hybridization data. 

All hybridization data, even of good global quality, was filtered and 
processed priorto addfcona, large scale analysis. This included global balancing 
of signals, identification and treatment of problematic signals, normalization of 

the hybridization data and filtering. 

1. SigjialBalar^m. Differences in labeling and hybridization can 
bias the signals obtained with a Cy3 probe relative to the Cy5 probes. For each 
hybridization, linear balancing is used to overcome this bias. The balancing 
coeffient is calculated as (sum P1)/(Sum P2). 

2. Dramatic signals. Two types of problematic signals are 
identified: very low signals and exceptionally variable signals. The first are 
signals below a pre-set threshold. The second are common control signals which 
significantly (>2 SDs) deviate from the average common control signal for a given 

59 



element. All problematic common con.ro. signals were replaced with the average 
signals for the common control in the given element. 

3. NonraEzation. In order to obtain meaningful differential 
expression values (TCC vs. normal) and to reduce deferences between sets 
(in ,er-block variance), a second step of normalization is performed. In this step, 
each of the balanced drfferential expression levels (relative to the common 
control) is normalized by the average differentia, expression of the given e,emen« 
in the norma, samples of the same set. The resulting normalized drfferential 
values give a measure of the deference between the expression o. me element ,n 
a given sample (norma, or TCC) and the average expression ,eve,s in norma, 
„ s5 ues. Note that due to the use of averages in replacement of problematic 
signals, the variability in the norma, samples is reduced by this procedure. 

4. Filtering. ,n order to reduce the data set and limit it to higher- 
quality elements there was restricted from further analysis any overall weak 
elements (no signal above 200) and any non-drfferentia, elements (for which no 
normalized differentia, va,ues exceeds 11.7,)- The remaining number of elements 
following these f„,ers is 6693. Low-<,ua,ity e,ements (where more than 20% of 
signals are problematic) were filtered only in later stages. 

Section 4. Class Prediction: Normal urotheliurn vs. non- 
Invasive TCC and selection of marker set of genes 
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To discriminate between normal urothelium and non-invasive TCC, 
each of .he gene with hybridization value is scored according to Ks -similarity- to 
the desired discrimination- N vs Non invasive TCC. Two independent (though 
re ,a,ed, standard scoring methods from the three described below were used. 

cw.ttr.1 Methor" "redlction 

Scoring methods 

a . e,..,wc nnnaired t-test. The t-test is a statistic for 
measuring the significance of a difference of the means between two 
distributions (mt , and m2) considering the variance (s21 and s22, within each- 
gr0 up The two popular are expected to be drawn from a norma, distribufion. 
■„ the case, mesa are the mean expression levels o, a gene in norma, urothelium 
and in TCC tumors, which are supposed to have a ,og-norma, distribution (thus, 
,og values are used,. Statistical significance estimates (p-va,ues, available for the 
, statistic. Since a ,arge number of measurements is avai,ab,e for a sma„ number 
of samples, a much more stringent threshoid of significance is used (p<106). 
which, according to the Benferroni adjustment corresponds to p<0.001. 

a . ,-trrr-"" error. This method scores genes 

according to probabiKy of error or misdassification. As par. of this procedure a 
discrimination *resho,d is determined. The threshold T(gj) is <aKen such .ha. the 
two types of misdassification error become equal, as: 

(D.1) T(g,) = trnKafsatei) + m2(g l rs1(g i )l / [s1(g i )«2(g l )l 
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and the significance of the reclassification error is given by 
(D.4) P = 1- F[(T(gi)-m2(a))/s2(gi)] 
where F is a distribution function N(0,1>- 
b . ^...mn^tina Prnnrfp rWcs (ROC) curves. ROC 
curves are used to evaluate the power of a Cassation method for dWeren. 
asymmetric weights of false negative vs. faise positive errors (or sensitivity vs. 
specificity), m diagnostic aprons faise negate errors can be detrimental 
while false posKives can be tolerated. A ROC cuwe plots the tradeoff between 
the ft* types of errors as the classification threshold varies. For each potential 
meshed, the rate o, true poshes is plotted against the rate of false positives. 
Accuracy (A) is indexed by the area under the curve. A straight line (i.e. 50.50 
chance =, correct diagnosis, no better than chance,, has A=0.5. Perfect accuracy 
(A-1) means that for a given threshold all predictions are correct. 

The first score used is the "student's unpaired t-tesf, as above- 
described,^., one-way ANOVA with two classes, which reflects the difference 
between the classes relative to the variance within classes. The distribution of 
this statistic is reso.ved and significance ievels o, each score (its p-va,ue) can be 
derived. The second method used scored genes according to an "estimation of 
prediction error, as described above, which again provides significance 
estimates (p-values) in a straightforward way. 
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Both scoring methods yielded similar numbers of elements with 
statistically signify, scores: 77 elements according to the t-,est scores <p<10*), 
and 63 elements with low .reclassification errors, 
i This list was further narrowed according to several additional 

considerations: 

nrrli.-i--,- - ■■p.^-tated genes. Non-invasive tests, such 

0 as a urine test, require the identification of tumor cells or proteins on a 

1 considerable background of norma, tissue. Since the inventors assume tha, only 
1 significant* up-regulated TCC genes have a chance to be detected on such 

1 backgro und, while genes down-regulated in TCC are no. be farthfully detected in 

™ non-invasive tests, they specifically selected such genes according to the,r 

» norma,ized dtfferentia,, Furthermore, genes having a particularly low expression 

normal tissues were prioritized, to minimize detection P rob,ems in further 



in 

assays. 



. 2 . r-nn«i*tent scores. Elements with high scores in both 
m ethods were phoned in the final list The error-based method was given 
preference over the ,-test scores due to the prediction thershold it provides. 

3. Bsdiffldancy, Approximately half of the clones on the array 
were derived by a subtraction procedure (SSH) enriching for TCC up-regulated 
Cones, inevitably, this significantly increases array redundancy, especially for u P - 
regulated genes. In order to address this problem, a large portion of the 
ignificantly scored clones and up-regu,ated genes has been sequenced (-900 
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Cones,. Only a sin g ,e representative C each redundancy group was retained ,n 
the list. 

« Element^. A stricter threshold of element quality was 
added, and only elements with less than 8 problematic signals were included (in 
rnos, genes a much smaller number of probiematic signals was encountered). 

5 . senejdentjtt The functional role fulfilled by different genes 
as well as previous Knowledge can change their prion*. For example, one low- 
scoring gene (FABP) was selected due to *s involvement in psoriasis and 
squamous cell carcinoma of the bladder. 



64 



10 



20 



30 



35 



Section 5. Expression patterns, scores and significance vaiues 

for the 22 short-listed genes of the invention 

The final subset of informative genes comprises the top 22 up- 
dated genes after station of consistency, redundancy, and quality filters. 
These genes obtained high scores as discriminators by both scoring methods. 

The differential expression patterns, statistical scores and 
significance values for the 22 selected genes are shown in Tabie 
expression levels are shown in the Table in the following order: 
Normal urothelium (16 first hybridizations); 
T1 samples (13 hybridizations); 
Ta samples (10 hybridizations). 

The levels of differential expression and the signal values of the 2 
■Secreted- probes are also shown (Table C1 , four columns headed 
••Secreted.. '). Strong negate dtfferentials indicate a gene potentially encoding a 

secreted or membranal protein. 

The statistical scores are given in the following order: 
1 . Estimation of mis-classification error is given under column 
-Em* T 2. P value - Fisher criteria (similar to T-test) is shown as P-values, 
(column •■Pva.ueFisheO. and 3. ROC value in the column headed with the same 



name. 
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Table C2 includes the raw measured signals for each gene 

control signals. P2 signals represent the 

interesting. The genes are sorted 



hybridizations. P1 are the common 
measured tissue samples, and are thus more 
by statistical significance with the top gene having the highest score. This is also 
the order in which they have been incorporated into the predictor (Table D1). 



Table D1 : Unsupervised analysis of tumor samples 
Hierarchical clustering of tumor samples (3 rd set) 



TC32A_TA 
TC33A_TA 
TC41A_TA 
TC42A_TA 
TC44A_TA 
T C31A TA 
TC43A_T1 
TC40A_T1 
TC30A_T1 
TC29A_T1 
TC28A_T1 
TC45AJN 
TC34AJN 



Classification of TCC tumors 

5093 genes 
Euclidean distances 
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Section 6. Sequence annotation and bioinformatic analysis of 

the genes with unknown protein product 

As shown in Table A, set 4, based on sequence annotation, 17 of 
the selected 22 markers are known, characterized genes; 3 additional genes 
code for hypothetical proteins. (One of these hypothetical proteins was identified 
through an EST contig of the original clone). The remaining 2 clones code for 
novel sequences. For one of them an EST contig was assembled, but 
homologous genes were not found. Only limited information is available for the 5 
novel or uncharacterized genes. According to the "Secreted" probe, as described 
herein, one of them, the CG.-81 hypothetical protein, is a potentially secreted or 
membranal protein. Domain analysis also indicated the presence of a 
transmembrane domain in this protein. The other four genes cannot be classified 
with confidence, although the unknown gene in clone 70E8, can be marginally 
classified as "potentially secreted". No additional significant domains were 
identified. 



30 Sectlon 7 . Bioinformatic analysis of known genes 

According to the biomedical literature, the 17 known genes were 

classified into three functional groups: tumorigenesis, keratinocyte differentiation, 

and cell motility and proliferation. Thus, these markers fulfil a functional role in 

TCC. It was noted that in a previous analysis, performed with sample pools on a 

40 general-purpose human microarray, similar functional groups (and in some cases 
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coined paten, application USSN 09,534,661 corresponding to POT patent 

publication number WO 00/56935 ) 

0„,y a few of these 17 markers were previously considered reiated 
t0 or seated inbladder cancer. These are the keratin family o, protein, some 
of which are Known TCC marKers. midkine, for which a single report ( PMiD: 
S65368B, implies its connection to invasiveness in TCC; and PABP-5 which is 
relate d to squamous cel, carcinoma of the bladder. A number of the other 

specificity of markers for TCC over other cancers is an important consideration. 

Four of the known up-regulated genes which are related to 
.umorigenesisareeHhermembranal or secreted. Threeof them were no, 

piously reported in TCC, and the fourth (midkine) has been related to 
invasiveness. Clear,,, secreted and membrane, proteins have a unique 
advantage for the development of a diagnostic test. 

intriguing amongst the functional group are makers related to 
keratinocyte differentiation. The keratins, which are cytoskeleta, proteins, are 
known markers for keratinocyte differential. Five dtferent keratins were 

. . *, 0 „na cenes Two of them are known to be TCC markers, 
detected in the top-scoring genes. 

The remaining *ree (KRT 7. 8. 17, were included in the 22 marker set 
severe, research groups using different experiment* approaches. Non- 
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con s iSt en tf ,nd i n g sas,c th e i r reg u 1 a«on,nTCCwe rer epo rt e.T ta s,pe* 

assay, can faci,i.ate and improve robust TOO diagnostics. 

A second major group o, markers associated with Keratinocyte 
oW erentiation are the StOO proteins. These are iow-moiecuiar-weigh, caicium- 
ending proteins which are probabiy invoived in the relation ot a number o, 
ceiiuiar processes inciuding ceii cyoie progression and ceii d«erent,ation. Pour 
d«erent S1C0 proteins (8100A11. A6. A13.P) are inciuded in the marker set. 
Hone has been previous, associated w«h TCC. S100 proteins were impiicated 
in oth er cancers (AML, coiorectai). The StOOP was tound to be down-reguiated 
upon androgen deletion in the androgen- dependent prostate cancer ceiMine 
LnC ap Another S100 protein, psoriasin (S 10 0A7, no, inciuded in the set,, ,s 
invo „ed in both invasive breast cancer, scuamous cei, carcinoma o, the biadder 
and psoriasis, another disease invoiving keratinocyte, Note, that another . 
psoriasis-reiated gene, PA-FABP is aiso inciuded in *is proposed marKer set. 
PA.PABP has aiso been impiicated in squamous cei, carcinoma o, the biadder. 
interacts between another FAPB (E-FABP) and psoriasin are we„- 
aocumented in psoriatic Keratinocytes. The identtfed S1 00 proteins and PA- 
FABP are important novel markers for TCC. 
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Section 8. Sequence annotation of genes with identical 

expression pattern in all tested samples 

The diagnostic assay is based on the genes which are up-regulated 
h TCC. in a., suggested tests, and mainly in the RT-PCR based assay, interna, 
contro, for each tested samp.e are benef.cia.. Such contro.s can be genes wh.ch 
are normally not upregu.ated in TCC. To this end the hybridization data was 

analyzed to identify genes which are: 

! . Expressed at a high, easily detectable level in all the control 
and TCC samples. This is important to enab.e detection even in small amounts of 
RNA. 

2 Genes which are not differential in TCC compared to normal 
urotheliur. Such genes were specifically selected according to their normalized 
differentials 14 genes were detected as suitable according to these critena 
(Section* hereunder!. These genes are included in the diagnostic assays, and 
used as interna, references for a normally, uniformly expressed gene in TCC and 



in non-TCC cases. 



Section 9 



: Expression patterns, signals and annotation report 



for control genes 

35 The differential expression patterns and basic annotation for the 14 

non-differentia, genes suggested as interna, contro.s is provided in Tab.e E (at 
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end of spectfcation , ]us< before Table 1). Tabie E displays dWerentia, 
expression resuits. and shows signals of al, genes in all hybridizations. 

The expression leveis are shown in the fallowing order: 
Normal urothelium (16 first hybridizations) 
T1 samples (13 hybridizations) 
Ta samples (10 hybridizations). 
A ,| 14 clones were fully sequenced from both sides. Full 
(contigized, sequences passed via the standard sequence annotation platform 
including sequence QC. chimer detection, and homology searches w,th,n 
Oenban.s non-redundant genomic and non-genomic nucleotide databases, the 
non-redundant protein database and the EST database. EST contigs were 

annotated. 

supported the choice of any one, or a combination of 



The results 
, of these 14 genes as internal controls. 



two or more 



Section 10. Class definition and characterization 

Staging and grading of TCC is not straightforward. In fact, 
subject decisions must often be made in order to classify tumors, and 
pathological experts can dtffer on the correct diagnosis. Unsupervised analysis 
me ,nods as well as supervised methods o, Cass prediction were used, in order ,o 

reduce the dependence on expert opinion. 

71 



During the quality control process (section 2). some separation 
hetween T1 and Ta samples was noted in me third se, of experiments, as well as 
in gioba, Custehng of a„ 41 profiles (39 norma, or non-invasive, plus 2 invasive 
TCC samples). Therefore Custehng of the tumor samples **n ft. ft« - 
only (Table 01) was pursued. This se. contains the largest and most variable 
collection of TCC samples. A clear differentiation exists between Ta and T1 
tumors. One tumor Cassffled as Tt^Ta resides within the Ta Custer, bu, ,s 
separated from o*er Ta tumors. The two invasive tumor samples are in the T1 
Custer (one clearr, inside, the other outing,. Thus a, the level o, hierarchical 
clustering Ta tumors are separated from T1 high-grade tumors. 

Standard scoring method (example 5, section 4) was employed for 
! cass predion in order to iden«y specific molecular markers that undertie the 

i Ta/T1 separation. 

I The Keratin 1 3 gene was found to be the highest scoring gene. H is 

down-regulated in T1 samples and up-regulated in Ta (relative to normal 
samp,es). Keratin 13 is Known to be expressed in urothelium. Its expression ,n 
urothelial tumors depends on their degree o, dWerentiation. It is expressed on,y 
in well-dWerentiated tumors and absent „om poorly drrferentiated ones (PMK* 
,70*547). Since most of the Ta tumors in this se. were classed as low grade 
„hi,e the T1 .umors were most* high grade, over-expression o, the KRT1 3 gene 
«„ be attributed to the degree o, the differentiation as we,, as to the staging of 

40 the tumors. ^ 
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The expression pattern of KRT1 3 in all TCC hybridizations was 
th en studied. The results are not straightforward. Approximately half o, the 
tumors show up-regu,ation of KRT13, while down-regulation is observed in 
others, with no o,ear correlation to eKher stage or grade of the tumor. KRTt 3 ,s 
highly relevant to the sub-classification of TCC tumors. However, the exact 
correlation to the classical clinical classifications remains to be elucidated in a 
,arger set of tumors involving different stages and grades and infollow-up 
studies. 

EXAMPLE 5: BIOINFORMATICS ANALYSIS OF 22 

SHORTLISTED GENES 

1 . Sequence - All clones were fully sequenced from both 
sides. Sequence passed the standard sequence criteria, except where otherwise 

noted (see Table 6) 

2 . Annotation - Full (contigized) sequences were passed 

through the standard sequence annotation platform including sequence QC, 
chimer detection, and homoiogy searches to GenbanK's non-redundant genom,c 
and non-genomic nuclide databases, the non-redundant protein database and 
the EST database. EST contigs were assembled for several novel genes for 
which ESTs were available, and further annotated. Complete annotation and 
sequence information is available in Table 5. 

3. Literature - an extensive search of the literature was 
performed for each of the Known genes. Detailed information is given below. 
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Known genes are Cashed according to their function in tumorigenesis, 
Keratinocyte delation, and cei, mo S ,i«y and probation. References are 

given as PMIDs. 

3.1 Genes associated with tumorigensis 

rnnr . tn, .ecretrrl tnd membrane! proteins 

3.1.1. Syndecan 1 (Accession: g il4506858|reflNM_002997.1|) 

An integral membrane protein, 310 amino acid-long, with a signal 
peptide at » NH,-terminus. Contains a matrix-interacting ectodomain with 
putative glycosaminoglycan attachment sites, a hydrophobic membrane- 
spanning domain and a cytoplasmic domain. Is connected to cel, aggregation in 
ma „gnan, mesotheliomas wKh epKhella, and/or sarcomatous morphology and ,s 
r eo. ulr ed .or wn.-1-induced mammary tumorlgenesis in mice. On the other hand, 
its expression is inversely correlated to the aggressiveness of basal cell 
carcino™. PMID: 2324102, 10912783, 10888884. 10770430. 

3.1.2. Hepatocyte grow* factor activator inhibitor type 2 (HAI- 
2) (Accession, gi|2924619|dbi|AB006S34.1|AB006S34) 

HAl-2 is a KuniU-type serine protease inhibrtor which was recently 
identified as a potent inhibitor of hepatocyte growth factor ac«va.or. I. was also 
independent* reported as placental bikunin (PB) and. as a protein over- 
expressed in pancreas cancer. However, its expression was conserved in the 
neoplastic colored mucosa, and no relationship was found between HAW/PB 
mRN A levels and colorectal tumor stages. HAM is produced in a membrane- 
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associated form and secreted in a protect truncated form. PMID: 

10695988 , 10762618 

3.1.3. Midkine (neurite grow* promoting factor) (Accession: 

gi|45051 34|ref|NM_002391 .1 1) 

Midkine is a heparin-binding growth factor, implicated in various 
logical phenomena such as neurona, survival and differentiation, tissue 
remodeiing and carcinogenesis, in the G401 cei, iine, midkine «a,es a cascade 
of intraceiiuiar protein tyrosine phosphorylation mediated by the JAK/STAT 
pathway after binding to its high affinity P 200( + )/MKR cel, surface receptor. The 
most intriguing feature o, midline in cancer is * augmented expression in 
advanced tumors at a very high frequency in a non-tissue specific manner, in 
addition, its high expression is aiso detected in precancerous lesions. Midkine 
exerts carcinogenesis-re.ated activities, inciuding transforming, ant,-apooto„c, 
angiogenic and flbrinolyUc ones. These data provide a possibiiity of clinical 

th erapy using Ks promoter region and therapeutic strategy choosing midKine as a 
moiecuiar target were aiso suggested. MK was suggested as a marKer for eariy 
and iatent biadder cancer disease (spec** of 0.86). Recent pub,ica«on 
demonstrated good correiation of MK over-expression win poor outcome ,n 
patients with invasive cancers. PMID: 10879061, 8714367, 10902971, 
10626184, 10545795, 10408712 
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• 2 SLC2A1 (GLUT1) (Accession: 

3.1 .4. Solute carrier family Z ^ 

gi|5730050|reflNM_00651 6.1 1) 

increased expression of glucose transport (GLUT1) has been 
.ported in many human cancers. Suppression o, GLUT1 mRNA has been 
sh o«n to suppress tumor growth. Some studies have reported associations. 
bet ween U expression and probative indices, whUs, others suggest tha, 
GLUT1 can be o, prognostic signincance. especialiy in iung cancer. No 
1 connection between GLUT1 up-reguiation and TCC has yet been reported. 

| PMID: 10983690, 10806305, 10795374. 

| rrr~ for nr0tei " S 

m 3.1.5. Cystatin B (Accession: 

f* gi|726301 1 |gb|AF208234.1 |AF208234) 

' Cystatins are endogenous inhibitors of lysosomai cyste.ne 

1 proteinases, the cathepsins (Cats, imbaiance between cathepsins and cystatins, 

and metastasis. Cystatins were found to be U p-re 9 u,a,ed in reiation to tarnation 

30 

ovahan, uterine, meianoma and coiorectai carcinoma). ,n contrast, reduced 
expression o, cyst* B was found in esophageai-carcinoma tissue and was 
assorted with *mph-node metastasis. The appiication o, cystaflns for 
prognosis, diagnosis, foiiow-up and anticancer therapy has been proposed (but 
oaaOCB. the preiiminary experiments in TCC, using genera, microarray 

76 



45 



oon.ain.ng 10.000 human ESTs, Cystatin A was found to be up-regulated in TCC 
poo, compared to pool of normals PM.0: 10566975, 9769367, 

9583733,10514828. 

3.1.5 Opa-interacting protein OIP3 
(Accession:gi|2815605|gb|AF025439.1|AF025439) 

Opa proteins are a famiiy o, outer membrane proteins involved ,n 
oonococoa, adherence to and invasion o, human ce„s. Pyruvate Kinase M2 is 
0,P3 which binds to OPA proteins. Modulation o. type U2 pyruvate Kinase 
activity by the human papillomavirus type 16 E7 oncoprotein has been 
demonstrated. PMID: 9990017. 9692838 

I - r ir ■ ^W ro " tiation of - 

Iroratinocvtes 

3.2.1. Keratins: keratin 19, keratin 7, keratin 8, keratin 18, 
m keratin 17 

Keratins, or cytoKeratins. represent a famiiy of more than 20 
different polypeptides which are important marKers of epithelial cel, 
differentiation. Both gene expression and protein ,eve,s are elevated (and even 
used as a marKer) in several pathoiogicai conditions including breast cancer, 
.dney tumors, small cel, lung cancer (SCLC), and pre-eclampsla. Measurements 
of cytoKeratins 19 and 20 fcvels in serum and urine are in use es u« marKer 
for bladder cancers. 
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Keratin 18 is a type-l keratin that is found in a variety of simple 
epKhelia, tissue, Pancreatic exocrine acinar ceiis and endocrine isie, cei,s are 
^-differentiated cells which express the Keratin combination 8 and 18, whereas 
„. less-drfferentiated ceiis o, the ducta, tree are characterized hy the additionai 

and 20 are increased in recta, adenocarcinoma and Pagefs disease. PMK>: 
,0755601, 10707834, 10782894, 10775728, 10762743, 9614373, 8911513, 
9445193. 2434380 

i 3.2.2. S100 proteins: S100A11, S100P, S100 Calcium binding 

protein A13, S100A6 

' S100 proteins are low-molecular-weigh. calcium-binding protems of 

t „e EP-hand superfamily and appear to be involved in the relation of a number 
S of ceilular processes such as cel, eye, progression and dWerentiatio, More man 

1 10 members of the S100 protein family have been described from human 

sources so far. Induced expression in tumors of some of these genes has been 
reported. 

30 S100A1 1 (or S100C/ Calgizzarin) 

Calgizzarin is a nuclear protein which MM. the actin-activated 
myosin M g(2 + )-ATPase activity of smooth muscie in a dose-dependen, manner. 
Cher Ca(2 + )-binding proteins such as S100A1 , S100A2. S100B, and ca,modu«n 
do no. inhib* actin-activated myosin M g(2 + )-ATPase activity. Caigizzarin can be 
involved in the regulation of actin-activated myosin M g(2 + )-ATPase activ«y 
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through *s Ca(2-)-dependen, interaction w«h actin filaments, it is expressed in 
mos, issues and ce,. iines. and co-localized wKh the psoriasin gene S100A7 and 
other S1O0 genes to human chromosome 1q21-q22. 

Calgizzarin was found to be remarkably elevated in colorectal 
cancers compared w,,h that in norma, colorectal mucosa. No similar alteration in 
expression was detected in breas, cancer. PMID: 10 4 8 6266, 10623577. 
7591220, 7889529 

S100P (Accession: gi|5174662|ref|NM_005980.1|) 
S10 0P overexpression is an early event that might play an 
import an, ro,e in the immor,a,*a,ion o, human breas, epithelial cel. in vitro and 
tumor progression in vivo. S100P expression was donated after remova, 
androgen from LnCAP prostate cancer cel, line. PMID: 10639564. 8977631 



S100 Calcium binding protein A13 
(Accession-. gi|5174658|ref|NM_005979.1l) 
S1 00A1 3 was found to be widely expressed in various types of 
liss ues including skeletal muscle, heart, kidney, ovary, small intestine and 
pancreas. Itwas shown to bind anti-allergic drugs and thus to be involved in the 
HMcn of degranulation of mas, ceils. Also, it was shown to be involved in the 
regulation 0.FGF1 aCivKy. PMID: 10722710, 8878558. 9712836, 10051426 
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Growth factor inducible 2a9/ca.cyclin/S100A6 (Accession. 

M14300) 

2A9 was isolated from stimulated quiescent fibroblasts. I, is induced 
by growth factors and over-expressed in AML. S100A6 was also suited to be 
involved in the progression and invasive process of human colorecta. 
adenocarcinomas. PMID: 10656447, 1952954 

0 

3.2.3 PA-FABP - Fatty acid binding protein 5 (psoriasis- 

15 associated) 

S (Accession: gi|4557580|ref|NM_001444.1|) 

I The fatty acid-binding protein (FABP) family consists of small. 

^ oytosolic proteins believed to be involved in the uptaKe. transport, and 

55 solubilization o, their hydrophobic ligands. PA-FBP can be invo.ved ,n 

1 Keratinocyte dtferentiation. In norma, *n. PA-FABP is expressed in basa, and 

pr,c kte oel, layers, and more strong, in the granuiar cei, layer. In psoriatic sKin. 
PA-FABP is expressed in suprabasal layers and more strongly in more 
differentiated Keratinocytes. In squamous cell carcinoma, PA-FABP shows very 
strong expression in squamous nests. Serum levels of intestinal fa«y acd- 
binding protein (,-FABP, sen,e as diagnostic marxerfor mesenteric infarction 
(acu te ischemic diseases o, the bowel). Expression of PA-FABP has been linKed 
to squamous cel, carcinoma of the biadder. PM.D: 9521644, 9438903, 8566578. 
40 8092987, 9307301 
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., - , -..^.H in cc" — "'»Y """iteration 

3.3.1 Actin gamma 1 (Accession: gH4501886lref1NM_001614.il 

B ||4501 886|ref|NM_001 614.1 1) 

Ubiquitously expressed in all eukaryotic cells. Beta and gamma 
actins co-exist in most cell types as components of the cyloskeleton and as 

mediators of internal cell motility. 

3.3.2 37 kD laminin receptor precursor/p40 ribosome 

associated protein 

(Accession: HSU43901) 

The 37 kD precursor of the 67 kD laminin receptor (37LRP) is a 
pdyoeptide whose expression is consistently up-reguiated in aggressive 
carcinoma. Interestingly, the 37LRP appears to be a multifunctional protein 
involved in the translationa. machinery and has also been identified as P 40 
riD osome-associated protein. It is distributed on the cel. surface as laminin 
binding protein P 67 (LBP/p67), in the nucleus, and on 40S ribosomes. PM.D: 

8760291. 10079194 

Throughout this application, various publications, including United 

States patents, are referenced by author and year and patents by number. Ful, 

citations for the publications are listed below. The disclosures of these 

publications and patents in their entireties are hereby incorporated by reference 

81 



into this application in order ,o more My describe .he state of the «t to which 

this invention pertains. 

The invention has been described in an illustrative manner, and it is 

to be understood that the terminology which has been used is intended to be in 
the nature of words of description rather than of limitation. 

Obviously, many modifications and variations of the present 
invention are possible in light of the above teachings. It is, therefore, to be 

I understood that within the scope of the described invention, the invention can be 

II practiced otherwise than as specifically described. 
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TCTT CAGCAuCACAAACTCATTCT CAGCAGCTGT GCGGCGvj ± ^ — 
TCTTCG \ 

>04 TCC 94c\_Ml3F. TXT. fa , constant = J^gg^^cTGAGCCGCC 
^GGCTTMTcfecCGGACCGCATCCG^ 

>2 0 TCC 60H4_M^F.TXT.fa , constant: g ?StCTTCTGGGTAGAGGGAT 
CCAAGGNAAAANA 

>2 6 TCC_4 4C1_M13 k TXT^f a , ccns tant^ -^^^cTGCTCATCAN 
ACTCATTGAACTTGAGC^CGANTCCT^ 



cngtcncaatggantcctcYagctct^gtca^tc 

CCAAAGCTGATGTCCTCAA^GGCTA^ 




ATGT 

>31 TCC_6SB9_H13F TXT f\ , ^ t K t ctJctXctCC M CCAT 

ScS^SctSct^™ 
cactttgttaaaacatgaaaaaaaaa 

>47 TCC_91B11_M13F TXT ;A=«^;J^^^cS^SACCa 

^Sggg^ctS 

>10 TCC_53H11_T3 m ^^_,__ rr A T ^ TTcn .«; TTTT cCTCTTNCCTTCTTAACT 

ttttttj^thttat^ 



atgcaccacttc^tttccc^ttgt^c^^at^^ctt^a^^^^^^^^^^^^ 
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>U TCC_25F2_MllF.TXT , co"«n^ ^, P^^tcagACTC-CTGCA 
ACCCTGGGAGAGAAg4t?GAAGA^ 




>26 TCC_50G6_Ml4TXT ^ atSS^TAAtS^CAACAACTATT 

AT ACTAC CATAGT GAG C I AT GAT T T T CT AAAAAAAAA 
>26_TCC„75^-13jU 032^ 




>30 TCC.76B3JQ3T M ^S^^!^^ 1 ^^ 
AAAGTCATCCTCCGTCTACtAGAGCGTGC^ 

l^AAAATAGATATATiTTTAAAATCCTTAAAAAAAAA 

>38 TCC_56E11_M13F tW , ^ t ^cCTGCaSaAACACTCCACC 

C CT GAT GT C AAT AAAC AACA<ICT GAAGC AAAAAAAAA 
>46_TCCJ78B11 M13^ 

>53_TCC_79G2 M13F E07U 

SccaSStStaaagagcSgtaacatca^ 

>82 TCC_89G3_M13F Bll U c? l c T S 
CAGGAGACCATCCGCGTCACCA^CCC^ 

AAGAAAGGGAAGGGAAAGGACT^ACCCCAA^oi qtgACCCACCAGTGCCTT 
CATGGGGCCTGGCCCACGCCCTCb^ 

ctgtctc^tcgttagctttaatc^ 

CA^AT^AAATA^-CTGACTTTTTuCCCCCAAAAAAAAA 
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secuencs : :?c 
CTTTGACGT 
GGGGCAATAAGA 



_C05_G2' 
[length 



wed Aug * 
(high qua.--* 



aw sequence: ESS 

; J ° -^J J^c^GC A? T AACTT CATCT AT GGAGC-CT C-CC 
'.CAGCTAC C GCT C? GAGGAGGC CT GC AT GCT CC GC T GC 



12 : 4 S : 2 ! 



ISctc^-a^ 
t^cctgtScgIc^ 
ctatgtgtgagc A 

> 46 TC- 27H5 m\3F F06 058.abl.fa TIME : Wed Aug 9 12 : 48 : 35 20 00 
tx^ini inforltlon :-ra W _se q uence:892 (h,gh qualxty: 169-406) 
seguence:170-28\ ^^ CCTTTAGCCC CTAAAACAACA 

AAAT AT TT GT CAGGAGffiG 
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aagac?gc^---- : — ^ 

>57_TCC_sl M13F^0S OS^TXT G ^^ s iJ i( 5^ S ?S 
CTGCAGGAGTCfcGCGT . CAA, C .TGACC - - ^"1 T( - TTC ATTGT AGAGAATTTGGAAA 

sequence:98-39^[len^^ 

GACAACACAAAGTATtexTGTTTUli^i CSAAGA r T ATTTTCCTTATTA 
CTTTGCCAGCCTCAGCTTAATC^GGAGACAAAGA^TA ^^^^^^^ 

sequence : 98 - 47 ^^fc^i^c?iTAGGATTAACTTGTAAAAAACA 
CTATGAATAGCTTCTTGC^^ACTTTAGGA^ 

TATCCTGAACTAAGATATGgAAATACTCATTl GAGAGTG TGTA 

CCATCTTGCAAATCATTTTAT^TCTCAi TTTTATCTTTTAAA 

^StS^^ 
ATTT AT TAT T AGACAT T ACT AC^AAAAGG 

>^Sffi%^- ! '- 2000 

GGAGAGAGGAGTG \ 
sequence : 95-602 t len ^J;508] \ 

^StSJStgtgSS^ 

GAAAGAATGCCTTGTGATTAlui GCTGCACCTGC GTTTTTA 
GAGAATGCCTCATAACCCACTGA-TCT 

^ T ^S™g T ca5acc?c™ 

TATGTTCTCAGGGAAATGCTTAGGTGGTGTCAdAAAAAGGvaCCTTT ±. CTT 



TNCTTTNC 
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— At-— Q-d TIME: Wed Aug 9 12: : 3 \ii. 

^^nal^^rr^e^=e:36i (high .ual^y:--.^ 

^^^V^^^GGAA^TGACAATTATAAATAGTAATAGTG 



\ n« »M - ttme : wed Aug S 12:48:27 20C0 

>54_,CC_30Z3 jdblF -^-^ " ^ ^e^ 336 (high quality: 65-254) 
trimming mroatLion. IjTt 

\ -w-, ttme • Wed Aug 9 12:48:39 2000 

TTCCAAAGTCTACCACACffCAAGAA^ii rTTCCA ACCTTTGCACAGTCC 
AAGTCACCATTTTAGAAATGCAAACCCACTTCCAACi-i * 

>w»?^sa?^ , ^.sr , «K 2000 

trimming information, raw b H 

sequence : S7 - 44 ^Ii^SrAGTCAGATTCTATAAATGGCAAGTAA 
CATTTTTAGTGACATTTTAARAGCAGT^ 

GCCTGAAGTGAGGATACTGC^TTTTCGGAGAAAA TCATTTTGC 
AAGTGTTTGCATTTTCTATT^GGGGGCAGGCAACTbl TAAGTCT 
ACAATTCTTGAACTGATGTCAfccACC^ 
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Table 5 : Su 



fcss . of 22 genes identified as potential TCC markers 




TCC-71E4 



TCC-94G3 
TCC-70E7 
| TCC-21G7" 

TCC-93G5" 
TCC-36B5 

t 



| TCC-70E8 . 



-si5Hi^TQ2 from Paten. WoSSpjT 

HumanmRNA fragment tar mesothelia. we >™ 

nt ~ - npnnmiriidenlir fV— "1 Tr - 9453 

-HSS7_TCC_70E7_1_M13Kta 

nnnr .-7 tpp. 70E7 1 M13R.fa 

HoWsapiens done PP722 unknown mKNA, 
CO^IG_nt_nonjenomic(idenUty):cont.g_TCC_21G7_Kr- 

nt n nn npnomielidenti rrVrnnn Tf>r - mG5 K| - ' a 

SecTe'nce 5 from Patent muSMi ; 
m.nonjenomic(identity):conug_TCC.36B5_RF.la 

fcfew S100 ca»dmg .fam P MS OOP . 
nt ^ miriid e ntitv):cont.g TCC 34A5 _ RF.f a 

W-™^ Trr - 70EB KFfa 



Homo sapiens mRNA for 
hepatocyte growth factor 
activator inhibitor type 2. 
complete cos_ 



Homosaoiens hypothetical 
protein (LOC51323), mRNA 



3i|100422A4lerr.DiA 
X017422.MAX0 4 .7i 

23, 

giilOQ4<l7Q|&nriD|A x 
. X014803.1IAX0149 

j 03 

gi|34067iemDlX032 ; * 
1?.1[HSKER7R_ 



gi|1044l985lgD(Ai-Z 
18028.1|AF218028 

gi(7263011|gblAhZU i x 
R734.11AF208234 _ < 
gt|10040588lemblA 
XD14141.1|AXD141 

41 

g i|4501886|ref|NM. 

001614.11 



g i|5174662|reflNM_ 
0059B0.1I 




nnne:conug._ ■ ^ ' - •;- 

S^eTTce 56 from Patent vvu^o4353 
n Lnonjenomic(identity):contig_TCC.78B1 l .RF.ta 



Human growth tactor- 
induable 2A9 gene, 
complete cds . 



■ ™* -PsttidenutWctintin T CC 1Q2C5„R ■& __ 

— . <-t /l/OT'^7^ mRNA' 



ot nnn ^nomididen t^ -"''n TCC 58A3 RF-fa 

„. nnn ^nomididen tfrfl— ''H ™C 57B3 Rr Ja 

onnmididentityV.c on .g ^ 

"Homo sapiens keratin 8 (KRTb) mKNA ; 



gi|770578Blret\NM. 
016025.11 



gi]6879340igb|AW3 
7ARflfi.1 |AW374686_ 
gi|30378|emb(2195 
74 1]HSCYTOK17_ 



Homo sapiens keratin 8 (KRTB) mRNA , 
nt_non^enomic<identity):contig„TCu_99G12_RF. l a 



nt ^ npnnmididentit Yl-"^ TCC 92D7_Kr.ja 

Sequence 82 from Patent WUWM,27 ; 
ntTnon^enomic(identity):contig.TCC_89 G3.RF.fa 

HoTr^^^ 

m nnn nenomidid rnttty)— *1 TCC 5BE11_^.ta 
-§55588 from Patent W099b^ ; 
nt nonjenomic(identity)xontig.TCC_25F2.RF.ta 




Homo sapiens midkine 
(neurite growth-promoting 
f a Mnr9 )fMDtO mRNA 



TCC-44C1 



-Homo sapiens Si 00 caicium^inding protein * i* w -7 
nt nnn aen omidinpntitv\:contio,TCC.44C^RF.fa^ 



Homo sapiens fatty acid 
binding protein 5 (psonasis- 
a^nriatecft fFABPSV mRNA 



gi|8924228|reflNM_ 
niflfi36.1| 
qi|10041391lemblA 
XD15411.1IAXD154 

11 

gil2815605|gb|AF02 

M9.11AF025439_ 

gil10041157|emblA 

X014890.1(AX0148 

90 



au — — 

gi|517465B|reflNM_ 
005979.11 



Mntes to Table 5 

1} ,n coiumn A y indicates mat the sequence aiso appears in Tabie 1 or 2. 
2) TaW e 5 includes known genes whose function in bladder cancer was 
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•r 



heretofore unknown and which were now found to upregulated in bladder 
oancer (identified by Accession Number) and also inciudes sequences of 
novel genes which have no identity to Known proteins or genes in the gene 



databases 



10 



G 

® 

lSI 

m 

m 
m 
ru 

m 

ru 

O 
2I 1 



30 



35 



40 



45 
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10 



20=, 



251 



30 



35 



_>I7_TCC_7 0d7_l Ml 35 ^ " - CC , GGTG GATCTCCTGTTTTAC 

AAAGGA \ 

^agtaaaIgac^gagacacctggtgaagactggaacgcatg.acgtc, 

ACC \ 

>contig_TCC_l4lEll J« _ TrTcr . ATTCAATTC CAAATACTGGT 

OT ^ T ^ TTOTTT< ^ TG 

>contig_TCC 102*5 GGTGGCACG GTGCACGCCAA 

agaaaaaaaaaaaaaaaa\ 

>contig_TCC_25F2_RF.fa \ ^ TGGCAGAAAA ^ iC TCAG 
TGGAGTGTGTCATGAACAATGTCACiCTG 

TTTGAGTCCTGCCTTCTCAAAG \ 

* 97 



40 



45 



CCTTCCG te-rT^G^GAAGA-AGAGCCCATTGTGCTGTCTCTCCA 
CATC-TGAT^GTTGTGTG.-TGAA.A ^.^cCTTCTCCA 
2C-ACACGTTteTGT.^oTTCA.AGAG^ TCTC AGATGACTGTGAA 

GAGGCCACTTCCAAGGATGCTGoAGAGTCTCTG^ 
GTTTGCACCTCTGCAGGCC^o^AC^ATGATGA 

.fcACAfcX™ 
GATGCCATG^GACATGGG 

>contig_Tci 44Cl_RF.fa TTCACATCC AAGCTCTTCATCT 
CTCATTGAAC*GAG.TCCGAGTCCTGAT^ 

TCTCATCAAGAfiAGCCCACATCCTTGAGCA^ 

ACCAGCTCTTT^CTCGTTGACGCTG^CTATCCTl 

GCTCTGTCAGT -^I^'S^SS?AGGGCTGACCTCTCCTCAGCGGC 



AGGACGGGTTTAG 
>contig_TCC_54fcll RF.fa TTTATT TTTCCTTACACA 

TTTTTT ttttttttJtggt^cgg^ 

AAAAAAA \ 

>contig_TCC_56EM R^.fa TCAGCTGTCC TGCAGCAA 
CTCTCCAGTTTGCAC^GT^ 

AAAAAAAAAAAAAAA \ 

>contig_TCC_57B3 g.fa TGTTGTGTATAG ATGCAAGA 
GGTACGACGGACCTGCG^GACTCCTGCCCT^ CAGACACTAAGTTAT 
TATTTATATATATTTTTGfeTTGTCAA^TTW 

A \ 

>contig_TCC_58A3 ^\fa TRCCGCCGCC TGCTGGAGGGAGAGGAT 



AGGTCCACCSGSCCiCCCGCtGMGaCTCR^ tctc 





15; I 



2b: 



2511 



30 



AC. 

lK^Cr'^^^r?Trr'i''T'CCCACGCTGGTACTTTGACGTGGAGAGvj 

- V - CTCC ^^;^S^GC^GCTCCGCTGCTTCCGCCAGCAGGAGA 
CTACCG ^;;S^^G-^CA^AGa-GGTGGTTCTGGCC-GGGCTGTTC 

atcct ^^;S^;^;ct-cc^ggLcctccatggtctacctgatccg 

TTTCTCTGGG^GTAGGMGGCT^ 

GCTTTGGAAA^CTCTAGoAGGCTCCTC^ 

AGCAGCCCCGA^TGTTTCCTCGCTGATCCATTTCT^ 

TGATG "^^SS?aSIgcattctgaaagaaggaaagtaaaatgta 
cxttcaaaaaaaaaAaaaaaaaa 

>c°n ti g- T ^-"^^-™ ATCCTC CTGCftGACCACGCCCGTCCT 

aaagagggcggcaggg^ctggagatcctcl TTGA cgaggg 

GCCTGTGGCGCCGTCTd^GGKTGCT-CCTC^i 
CCGTTAGCTCCTGTGGC^CACCCTG^ 

ataaagtaaaaaaaaaaaaAaaaaa 

>contig_TCC 78B11 RF\f^ CCAGGAGTATGTC ACCTTCC 

AAAAAAA \ 

CCAAGGAGCCCCTGuTGTLAUAi^u£ 

CAGGCCCGAGATGTGACCCACCAGTGCCTTCTGT^ 

ATCAATCATGCCCTGCCTTGiCCCTC^ACTCCC^Ab 




35 



AAAAA 

CCAGTCAGTGGTTGAAGTC \ 
>contig_TCC_93G5_RF. f a 



40 



45 



CAP -A i 1 jJ^AAAAAAAA 

^■ G °^;X;^rrACCAC^CCACTCCTCCAGCCACCACCCACAATCACA 
C"GAGCCGt x TCCCACCAC : C x A. ^C-ACAGTGAGACA 

GGCCTGAGTfeAAAAAAAAAAAAAAAAAAA 

.SS^feSGfecraGCTCGGCCTATGGGGGCCTCAC^GCCC 



GGG 



TTTTTTTTTGGTCC. 



The first two seLnces are from opposite ends of the same polynucleotide, (and 
are thus in the saiie gene) . All the other 21 sequences are contigs. 
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